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To my Students 
Past, present, and future 


Foreword 


The staff of the Department of Electrical Engineering at the Massa- 
chusetts Institute of Technology some twenty years ago undertook an 
extensive program of revision as a unit of its entire presentation of the 
basic technological principles of electrical engineering. By early 1943 
this collaborative enterprise had resulted in the publication of three 
volumes of a projected series. Publication of this complementary book 
of the series was delayed by the war. 

The decision to undertake so comprehensive a plan rather than to add 
here and patch there came from the belief that the Department’s large 
staff, with its varied interests in teaching and related research, could 
effect a new synthesis of educational material in the field of electrical 
engineering and evolve a set of textbooks with a breadth of view not 
easily approached by an author working individually. 

Such a comprehensive series, it was felt, should be free from the dupli- 
cations, repetitions, and imbalances so often present in unintegrated 
series of textbooks. It should possess a unity and a breadth arising from 
the organization of a subject as a whole. It should be useful to the student 
of ordinary preparation and also provide a depth and rigor challenging 
to the exceptional student and acceptable to the advanced scholar. It 
should comprise a basic course adequate for all students of electrical 
engineering regardless of their ultimate specialty. Restricted to material 
which is of fundamental importance to all branches of electrical engineer- 
ing, the course should lead naturally into any one branch. 

Such a basic synthesis, it is felt, has been appropriately achieved in the 
first three volumes. In the course of their generation it became more and 
more evident that the development of further extensions of this basic 
material covering specialized fields would correspondingly become more 
and more the responsibility of individual authorities who could relate 
their work to the basic structure. 

The four volumes and the organized program of teaching out of which 
they have grown, are thus the products of a major research project 
to improve methods of technological education. The experience gained 
through the years in this stimulating exploration, together with the 
rich background of accelerated development contributed by the circum- 
stances of war, opens the way for further evolution in this undertaking. 
Perhaps the most interesting potentialities lie in the influence of the social 
sciences and humanities on the extensions of this vital adventure. 


J. R. Kirtan, Jr. 


Preface 


When the writing of basic text material for the undergraduate curricu- 
lum was first undertaken by the Staff of the Department of Electrical 
Engineering we felt that this material should not merely fill the immediate © 
needs of the undergraduate subjects for which it was primarily intended. 
but in addition should offer stimulation and incentive for further study. 
For this reason occasional glimpses were afforded of the horizons which 
lie beyond the artificial boundaries set by conventional limitations in the 
treatment of technological principles. The collateral discussions engen- 
dered by these aims tended to appear digressive; moreover, they became 
so numerous as to interfere seriously with conciseness, continuity, and 
clarity. We therefore determined to relegate these supplemental  treat- 
ments to an appendix. As the project developed the appendix became 
too large to justify itself; in fact, it took on the aspect of a text in itself. 
Thus evolved the concept of companion or reference volumes for collateral 
study. 

Later, when we evaluated this concept as to scope and organization, 
_ we concluded that the supplemental material would be more useful if the 
discussions of a purely mathematical nature were collated and separated 
from the applications to be predicated thereon. This book is the result 
of this effort to avoid redundance and to attain a logical unification. The 
applications themselves, then, were to appear as further reference volumes 
of the series depending for their foundations on The Mathematics of 
Circuit Analysis. 

In the course of these efforts war intervened. Through its exigencies 
there was an unprecedented acceleration of scientific and technological 
effort, compressing into a few years what normally would have taken 
- decades to achieve. Necessarily the revision program was held in abeyance 
during this critical period with the result that the Department of 
Electrical Engineering is now reconsidering the course revision program 
and modifying its prewar plans. The basic series must be reappraised in 
the light of the vast wartime developments. This may well lead to the 
addition of material to the earlier basic texts. Moreover, there must be 
drastic reconsiderations of the supplemental volumes which will form 
extensions of the basic series and complement this particular reference 
volume. Since at the moment this book goes to press the Department is 
in the midst of reevaluating the extensive revision program, it is impos- 
sible here to make precise pronouncements either as to the character or 

ut 
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number of volumes to follow. Although thus far it has been the practice 
to publish the volumes of the series without naming collaborators, since 
the immediate volume came about largely through the inspiration of 
Professor Guillemin, and since it was written entirely by him under the 
trying pressures of war, it has been decided to recognize him as author. 

As to the subject of this volume it may be well to emphasize that a 
mathematical textbook written by engineers is to be looked upon as an 
idea conveyor without claim to rigor. The discussions given herein should 
be regarded as being plausibility arguments rather than proofs. The 
primary purpose is to stimulate interest and lay a background of general 
understanding upon which the student may later build more carefully. 
In no wise therefore is the character of presentation here given to be 
looked upon as a substitute for the more formal rigorous treatment of 
the mathematician. 

Although other books of this nature have been written, in the planning 
of this book we felt that a rather complete assemblage of mathematical 
topics, needed specifically or collaterally in the analysis and synthesis of 
electrical networks and in the attack on field problems related to trans- 
mission lines, wave guides, and antennas, was still outstanding. The 
opportunity to include more detailed discussions of certain topics here- 
tofore given too little attention has been capitalized, we hope, to the 
advantage of student and researcher alike. 

In the field of advanced algebra, for example, we believed that a 
discussion of determinants and matrices becomes ever so much more 
meaningful when coupled with the geometrical interpretations provided 
by the subject of linear coordinate transformations and the closely 
related discussion of quadratic forms. Thus the first four chapters of 
this book bring together a collection of topics in advanced algebra that 
form a closely interrelated mathematical unit, and an indispensable unit 
in the foundations of circuit theory or any other field of application dealing 
with vibrations or particle dynamics. 

The fifth chapter, on vector analysis, is incorporated at this point 
since the geometrical and algebraic ideas involved are closely: related to 
the foregoing material, and because the two-dimensional aspects of 
vector analysis and field theory are very helpful in lending physical 
clarity to numerous topics in complex function theory which is taken up 
next. Here there is considerably more detail than is to be found in exist- 
ing textbooks for engineers on this subject. While the usual book for 
the most part is content to lay a nominal background for the methods of 
complex integration plus some notions about conformal transformations 
for field-mapping purposes, here it is our aim to provide a physical and 
geometrical feeling for the properties of complex functions adequate to 
meet diverse needs in the network synthesis field. The last two articles 
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in this chapter, for example, are specifically concerned with a detailed 
discussion of factors relevant to stability considerations and the properties 
of physical impedance functions. 

The final chapter, dealing with Fourier series and integrals, stresses 
a number of items with which the engineer is especially concerned such 
as the convergence of Fourier series, the approximation properties of its 
partial sums, singularity functions and their properties, elementary 
transform properties, evaluation of inverse transforms through complex 
integration, and their approximate evaluation through use of the saddle- 
point method. 

THE DEPARTMENT OF ELECTRICAL ENGINEERING 


Cambridge, Massachusetts 
January, 1949 
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CHAPTER I 


Determinants 


1. DEFINITIONS AND USEFUL PROPERTIES 


A discussion of the theory of determinants may be approached in a 
variety of ways. For the reader who already has an acquaintance with 
this subject and can, therefore, dispense with introductory remarks, the 
following procedure* is particularly effective since it strikes directly at 
those ideas which make the determinant a useful tool. 

A determinant is commonly written in the form 


M1 12 Gn 
@q1 G22 °°- 

7 oo i Ren ea i [1] 
Ont Ong *** Gna 


in which the vertical lines enclosing the array of elements a, are intended 
to take the place of parentheses as an indication that these elements are 
the variables of the function A, just as f(x) is written as a symbol for a 
function of x. 

The determinant is said to be of the mth order when it involves 1 rows 
and m columns, the total number of elements then being ”?. The. italic 
capital letter A is used as an abbreviation for the function whose elements 
are denoted by the lower case letter a. Thus, B may represent another 
determinant with the elements b;,, etc. The first index on an element 
indicates the row, the second index the column in which that element is 
situated. 

The determinant may be defined uniquely in terms of the following 
three fundamental properties: : 


I. The value of the function is unchanged if the elements of any row. 
(column) are replaced by the sums of the elements of that row 
(column) and the corresponding ones of another row (column); 
for example, if ay1, 412, °** ain are replaced by (aii + a31), 
(ar + 32), °° (ain + aan). 

Il. The value of the function is multiplied by the constant k if all the 
elements of any row or column are multiplied by k. 

III. The value of the function is unity tf all the elements on the principal 
diagonal, that is, a11, 492,°°* Ann, are Unity and all others are zero. 


*C. Carathéodory, Vorlesungen ixber reelle Funktionen (Leipzig, 1918), Ch. VI. 
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To these three fundamental properties may be added the following 
derived ones: 


IV. The first fundamental property may be amplified to the effect that 
an arbitrary factor times the elements of any row or column may be 
added to (or subtracted from) the corresponding elements of another 
row or column. 

V. The algebraic sign of the function is reversed when any two rows or 
columns are interchanged. 

VI. The value of the function is zero if all the elements of a row or column 
are zero, or if the corresponding elements of any two rows or columns 
are identical or have a common ratio. 


Rule IV may be seen to follow from I and II. As shown in the numerical 
example below, the elements of the third column are first multiplied by &; 
the resulting &-multiplied elements are then added to the respective ones 
of the first column, after which column three is multiplied by k7!, thus 
restoring to its elements their original values. 


i 3° 1 3 2k ({+2k) 3 2k 
AS 4-2 € kA=|4 2 6h |=| (446k) 2 68 
3.17 34) Je (3+7k) 1 7k 
Gi 2ey3 2 
A=| (4+6k) 2 6 [2] 
(34+7k) 1 7 


Rule V is a consequence of I and the extended form IV of II. Thus, 
suppose column 1 is first added to column 3, next the resultant column 3 
is subtracted from column 1, and, finally, this resulting first column is 
added to the resultant column 3. The net effect is to interchange columns 1 
and 3 and prefix all the elements of the first column with minus signs, 
as illustrated below: 


tr 3 3 1 30 
A=|4 2 6/=|4 2 (6+4) 
5.409 3° 1 74-3) 
oS O41) a9 3° 1 
=|-6 2 6+4)|=|-6 2 4 [3] 
3) ~7 1 3 


The first part of rule VI follows from the property II for & = 0, and the 
second part is seen to be true on account of IV because a row or column 
of zeros is obtained when, for a suitably chosen factor, the k-multiplied 
elements of one of the proportional rows or columns are subtracted from 
the respective elements of the other row or column. 
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2. EVALUATION OF NUMERICAL DETERMINANTS 


The properties discussed above may be applied to the numerical 
evaluation of determinants, as is best illustrated by the following numeri- 
cal example. Let 


A= 


1 
4 [4] 
3 


Step /. Subtract from the second row the 4-multiplied elements of the 
first row: 


a ae 
A=|0 -10 -2 [5] 
a ne; 


Step 2. Subtract from the third row the 3-multiplied elements of the 
first row: 


1 *3. 3 
A=|0 -10 -2 [6] 
0 -8 1 


Step 3. Subtract from the third row the ;8,-multiplied elements of the 
second row: 


{Bae 
A=|0 -10 ~2 (7] 
0 oO 18 


Step 4. Subtract from the second column the See elements of 
the first column: 


tf. 0 2 
A=|0 -10 -2 [8] 
0 o 43 


Step 5. Subtract from the third column the 2-multiplied elements of — 
the first column: 


1 0 0 
A=|0 ~10 -2 [9] 
0 o 48 


Step 6. Subtract from the third column the ;2,-multiplied elements of 
the second column: 


[10] 


oOorF 
| 

= 
(=) 

of 

oo on) 
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Application of the fundamental properties IT and TIT then gives 


1 0 0 
A = (1)(—10)8)|0 1 0] = @)(-10)@8) = -26 = [11] 
001 


It is useful to note that the modifications involved in the last three 
steps of this process do not influence the values of the elements on the 
principal diagonal, the product of which is equal to the value of the 
determinant. This fact may be stated in the form of an additional derived 
property: 

VII. The value of the special determinant in triangular form: 


My Go 3 °°" Gin 
0 a22 423 aan 

A=/0 O | 43 -++.d3n [12] 
0 0 O +++ Gunn e 


ts given by the product (a1822°°: ann) of the elements on tis 
principal diagonal. 


With the help of this rule the value of the determinant in the above 
example may be set down after the completion of the third step. 

If in the determinant A the rows and columns are interchanged, the 
values of the elements on the principal diagonal are not affected; and if 
the above operations with respect to rows are then replaced by the same 
operations with respect to corresponding columns and vice versa, the 
same final value is evidently arrived at. This fact demonstrates the 
equivalence of rows and columns as far as the value of a determinant is 
concerned. For convenience in reference this is stated as the property: 


VIII. The value of a determinant is unchanged if its rows are written 
as corresponding columns or vice versa. 


* In numerical work, the method of evaluation illustrated in the above 
example is short and convenient to apply. When an analytic result is 
desired, however, other methods are usually preferable. They are given in 
Arts. 4 and 5, to which the discussion immediately following serves as an 
introduction. 


3. MINORS AND COFACTORS 


If in the determinant A of Eq. 1, one or more rows and a corresponding 
number of columns are deleted, the remaining square array of elements is 
again a determinant. It is referred to as the (m — p)-rowed minor (or 
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minor determinant) of A, where the integer p denotes the number of rows 
or the corresponding number of columns which have been deleted. Thus 
the -rowed minor is the determinant itself. An (2 — 1)-rowed minor is 
also spoken of as a first minor, an (m — 2)-rowed one as a second minor, | 
etc.* 

The minor is customarily denoted by a symbol whose indexes refer to 
the canceled rows and columns. Thus the minor M;, is formed by cancel- 
ing the 7th row and the Ath column in A. It is quite common to speak of 
My, as the minor of a;,, or as the minor corresponding to the element aix, 
although (according to the immediately following discussion) it should 
more properly be referred to as the complement of a,,. 

A minor of second order, denoted by Mx, is formed by canceling the 


ith and rth rows and the Ath and sth columns. The extension of this 
notation to the designation of minors of higher order is readily recognized, 
but when the number of canceled rows and columns is large (cases of this 
sort are infrequent in engineering applications), such notation becomes 
awkward and is usually replaced by some other expedient which seems 
more effective at the moment. 

The elements which lie at the intersections of the canceled rows and 
columns, arranged in a square array in the same order (from left to right 
and from top to bottom) as they appear in the original determinant, form 
another minor determinant N which is called the complement of M. The 
complement of a first minor is a single element; that of a second minor is 
a two-rowed determinant, etc. 

In particular, the minors formed by canceling the same rows as columns 
(these intersect on the principal diagonal) are called principal minors, and 
their complements are again principal minors. 

An alternative view may be taken with regard to the formation of 
minors. Instead of obtaining the minors by canceling rows and columns 
in the original determinant, they may be formed by first selecting from the 
determinant certain rows, and subsequently selecting from this rectangu- 
lar (nonsquare) array any like number of columns. Or, from a given set 
of columns in the determinant A, minors may be formed by the selection 
of corresponding numbers of rows. The minors thus formed are evidently 
the complements of those obtained by the process of canceling the same 
combinations of rows and columns. 

If M is a minor of the determinant A and N is its complement, and if 


the rows and columns coniained in M are formed from thei, 7, k, +--+ th 
rows and the gq, 7, s, --- th columns of A, then 
Nx (- i ce a aia a Be a aa [13] 


99 66 


*The terms “minor of first order,” “second order,” etc., are also used. 
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is referred to as the algebraic complement of M. This differs from the 
ordinary complement only in its algebraic sign. If the sum of the indexes 
referring to the (first, second, etc.) rows and columns of A contained in M 
is an even integer, the algebraic sign, by Eq. 13, is +1; if this sum is an 
odd integer, the sign is —1. 

The relation between a minor and its complement is evidently a mutual 
one in the sense that the two designations may be interchanged. Whereas 
M may be called a minor and W its complement, N may be looked upon 
as the minor and M as its complement. Thus, a single element may be 
thotight of as a one-rowed minor. If the element is @,,, its complement is 
the minor Mz. 

The algebraic complement of the single element a,, is sufficiently 
important to deserve a special name and symbol. It is called the cofactor 
of a, and is quite commonly denoted by the corresponding upper-case 
letter with like subscripts (although various other notations are also 
encountered in the literature). Thus, the cofactor of a, is given by 


Ag = (-1)'"* My [14] 


It differs from the minor (which is the complement of a;,) only in algebraic 
sign; hence the cofactor is sometimes referred to as the signed minor. 

The indexes 7 and k, whose integer values determine the sign-controlling 
factor (—1)***, refer respectively to the row and column intersecting at 
the point where the element a, is located. If the cofactor is formed for an 
element in the original determinant, its indexes and those appearing in 
the sign-controlling factor obviously agree with the indexes appearing 
on the element in question, because the indexes on an element of the 
original determinant indicate respectively its row and its column positions. 
This correspondence is, however, no longer consistently true in a minor 
of the original determinant. 

For example, the minor Me3 of A, Eq. 1, reads . 


441 G2 414 G15 *** Ain 
@31 432 434 435 °** Ggn 

Moz = | G41 Gag O44 Ogg +++ Gan [15] 
Qni1 Gna Gna Ons ann 


Here the element 32, for instance, is located at the intersection of the 
second row and the second column. This is referred to as the (2,2) position 
in the minor determinant of Eq. 15. In general, the term (7,5) position is 
used to indicate the location at which the rth row and sth column of a 
given rectangular array intersect. The object of the present argument is 
to point out as a typical case that in forming the cofactor for the element 
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zo for the minor determinant of Eq. 15, the sign-controlling factor is 
(—1)?+? and not (—1)?*?. 

If only the algebraic signs of the cofactors are set down at the positions 
of the corresponding elements in a rectangular array, the following 
checkerboard of + and — signs is obtained: 


ey ES es 
Qo See 
a He Galt [16] 
ede aes 


This pictorial statement for the signs of the cofactors is sometimes referred 
to as the “checkerboard rule.” 


4, LAPLACE’S DEVELOPMENT 


In the manipulation of determinants, and sometimes to facilitate their 
numerical evaluation, a process of development formulated by Laplace 
is frequently useful. It may be stated in the following form: 


If all the minors are formed from a selected set of rows or columns of a 
determinant and the products of these minors with their respective al- 
gebraic complements are added, the resulting sum is equal to the deter- 
minant. 


If a single row is selected, this development reads 
A= adn tdidigt+:-+indin G=1, 2, +++) [17] 
For a single column, the result is written 
A = aA, + d¢Aoe +--+ OntAnk (R= 1, 2, ---+m) [18] 


In Eq. 17 the determinant is represented by the sum of the products of 
the elements of any row with their respective cofactors. In Eq. 18 a 
corresponding summation is carried out with respect to the elements and 
cofactors of any column. This simplest form for the Laplace development, 
which is also called an expansion of the determinant along one of its rows 
or columns, is the one most frequently used. 

It may be of interest, however, to illustrate a more complicated example 
of this type of development. Let the following fourth-order determinant 


1 G2 3 G14 
a a a a24 
Ave 21 22 423 [19] 
a3, G3q 33 434 
Q4, qq 43 O44 
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tion of minors. All possible two-rowed minors are systematically formed 
as the rows: 1, 2; 1, 3; 1, 4; 2, 3; 2, 4; 3, 4 are selected from these columns. 
The sign-controlling factors of the corresponding algebraic complements 
are respectively: 


(-1)He = +1 
(—1)124148 = +1 
(—1) 4 _ +1 
(—1)! 424248 oem | 
(—1) eee ee | 
(-1)? he = +1 


With the terms written down in this order, the development reads 


41 42 x %3 34 | | G11 ie x 423 G24 
421 422 G43 44 a31 32 443 a4 
a1, 2 a23 24 G21 42 a3 a4 
+ x + x . 
G41 42 433 34 a31 432 a43 a4 
G21 422 413 4414 431 32 @3 WN4 
= x x [20] 
a41 a2 433 434 a4, G42 423 24 


By means of the Laplace development a determinant may evidently be 
evaluated in a variety of ways. One possible method of evaluation consists 
in repeatedly applying the simplest form of development given by Eggs. 17 
and 18. In the first step of this process, the development is given by the 
sum of 1 terms, each of which is the product of an element and an (n — 1)- 
rowed cofactor. In the second step, each of these cofactors is similarly 
developed, thus yielding for the determinant A a sum of n(n — 1) terms, 
each of which consists of the product of two elements and an (m — 2)- 
rowed cofactor. As the process is continued one recognizes that the final 
evaluation of A is given by the sum of 7! terms, each of which consists of 
the product of ” elements. 

The determinant is, therefore, a rational integral function, homo- 
geneous, and of the mth degree in its elements. In any term of the final 
evaluated form, the appearance of the product of an element with another 
element of the same row or column is not possible. This fact is readily 
appreciated by noting in the term @12A1o, for example, that the cofactor 
Aj. contains no elements of the first row or second column. Hence none 
of these elements can subsequently appear in a term containing a). The 
determinant is, therefore, a linear function in the elements of any one row 
or column, 
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5. OTHER METHODS OF EVALUATION IN NUMERICAL OR FUNCTIONAL 
FORM 


The evaluation of a determinant by means of the Laplace develop- 
ment, although useful for numerous analytic investigations, is a long and 
tedious process. The solution of simultaneous linear equations by means 
of determinants, as discussed in Art. 8, is usually found in numerical 
problems to involve a larger number of component operations than a 
systematic process of elimination. This situation is true even when the 
determinant and cofactors are evaluated by the method given in Art. 2, . 
although this méthod parallels the elimination process in its essential 
steps. 

Alternative abbreviated methods of solving such equations are given 
in Arts. 7 and 11 of Ch. If. From a broader standpoint it is well to be 
familiar with numerous processes of evaluating determinants, so that the 
particular conditions of a specific problem may be met most expeditiously. 
In this regard the following remarks may also prove useful. 

Evaluations of two of the simplest cases by means of the Laplace 
development method are written down so that their resultant forms may 
be examined. 


1 2 


411222 — 21412 [21] 
G21 92 


Qi G2 13 

ay tae ae 411499033 + 12423431 + 13021432 [22] 
— 31422013 — A32023011 — 433021412 

a31 432 433 : 


By inspection of Eq. 21 it may be said that the value of a two-rowed. 
determinant is given by the product of the elements on the principal 
diagonal less the product of the elements on the conjugate diagonal 
(lower left to upper right) as indicated in the following by arrows: 


principal >, _7™ negative product 

diagonal a 92 

4 [23] 
oer UN 

conjugate! “az 

diagonal “at-positive product 


This is called the diagonal product rule. It is applicable in extended form 
to the evaluation of a three-rowed determinant. Here there are three 
positive and three negative products, the positive ones being formed by 
elements on the principal and adjacent parallel diagonals and the negative 
ones by elements on the conjugate and adjacent parallel diagonals in a 
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manner which is more easily understood if the first two columns are 
repeated so that the arrows may continue straight, thus: 


negative products 
Fi 7 Ka 


eo ee ree 
5 
@i “Ayn Fig| A@u 12 | 
a uN oa s a : 
Gn Oi 023 | Peay Gan | repeated columns [24] 
7 ! ~ 
; 


ae GOR G \% ' 
LQ31 32 Joy! O31 G32. | 
positive products 


The result is seen to check with Eq. 22. 

An extension of this rule does not yield the value of fourth and higher 
order determinants, as may readily be appreciated from the fact that the 
number of terms in the final evaluation must be 7!, whereas the diagonal 

_ product rule yields only 2” terms. If nm = 4, there remain. 4! — 2 x 4 = 16 
terms unaccounted for after all diagonal products have been formed. 

From a more comprehensive study of determinants, it is seen that all 
the terms in the final evaluation may be found by writing down the group 
of elements on the principal diagonal 


411. Gq O33 °** Ann 


and carrying out all permutations of the first subscripts, keeping the 
second subscripts fixed, or vice versa. In either case there are as many 
different products as there are permutations of 7 things taken 7 at a time, 
which is x! 

In this process, the algebraic signs of the various terms are controlled 
’ by the rule that all permutations formed by an even number of inversions 
of the permuted subscripts represent positive terms, all others being 
negative. Thus for = 4 the possible permutations are 


Even number of inversions Odd number of inversions 
23 4 3 


RP PR Pw WW © Re be 
WHR SP Pe KH RH |! WD 
DHPWNHwhRENEE 
RPwWNMOQF WRN PD 
BP PR DR WH & W eH bs be 
BOWE PRP DY PD KH WW 
WrRHWFE PNF PD 
BN WNW We PD Pe PS 
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The twenty-four terms written with these as the first or second set of 
indexes, the fixed set being 1 2 3 4, and prefixed with algebraic signs 
according to the stated rule, represent the evaluation of the fourth-order 
determinant. 


6. BORDERED DETERMINANTS 


A determinant of given order may be transformed into a determinant 
of higher order without changing its value, as is readily seen by applying 
the ideas of the Laplace development to the following example: 


0 1 Br Be [25] 


a1 2 
421 422 


The elements ao, 01, a2, 81, B2 may have any values. The process can 
evidently be varied by placing the zeros to the right or above or below the 
rectangle containing the a;,’s. The resulting form is referred to as a 
bordered determinant. 


7. PRODUCTS OF DETERMINANTS 


The product of two determinants of like order can be expressed as a 
single determinant of the same order. If the two determinants are initially 
not of the same order, one of them can be bordered. In the present discus- 
sion the determinants can, therefore, be assumed to have the same order. 

The procedure for obtaining the elements of the product determinant 
is best illustrated by means of a simple example. By the Laplace develop- 
ment the transformation of the following product is justified: 


a1 2 —1 0 

_ | G1 ee 0 —1 

ig. O° oa. baa | Ue 
0 


0 bor ae 


According to rule IV, Art. 1, the resultant fourth-order determinant may 
be modified in the following manner without changing its value. First 
the 6;;-multiplied elements of the first row and the 6)2-multiplied ele- 
ments of the second row are added to the corresponding elements of the 
third row, giving 


bi bie 
ber bee 


Q@1- G2 
ao, deo 


AXB= 


a1 G2 —1 0 
_ | @1 422 0 —1 
BES Ci. Can 0 0 [27] 
0 0 ba bos 
in which 
C11 = 41011 + Aeibye . 
C21 = 2611 + Goabi2 [28] 
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The object of this transformation is to produce zeros in place of the ele- 
ments },; and 6, in the fourth-order determinant of Eq. 26. Now both 
the b2,-multiplied elements of the first row and the bo2-multiplied elements 
of the second row are added to the corresponding elements of the fourth 
row, giving ; 


a, ae —1 0 | 
A x B a @o1 422 0 -1 [29] 
C11 C21 0 0 
Cig C22 0 0 
where 
Cig = A ba, + Gorde0 [30] 


Cog = Gabe, + d2abo0 
By the method of Laplace’s development the determinant of Eq. 29 is 
simply 


C11 «612 
C21 C29 


C11 C21 
C12 C22 


AXB= 


[31] 


the second form being obtained by an interchange of rows and columns. 
Examining the expressions for the elements c,,, as given by Eqs. 28 and 
30, it is observed that they are formed by multiplying successive elements 
in the columns of the determinant A by successive elements in the rows of 
the determinant B and adding the results, the specific columns and rows 
involved being indicated by the first and second subscripts respectively 
on G;z. Thus ¢; is formed from the elements of the first column of A and 
those of the first row of B; cyz is formed from the elements of the first 
column of A and those of the second row of B, etc. More briefly, the c’s 
are said to be formed by multiplying the columns of A by the rows of B. 

Since the individual values of the determinants A and B are unchanged 
by writing their rows as columns, it is clear that the value of the product 
determinant is unaltered if its elements are formed by multiplying either 
the rows or columns of A by either the rows or columns of B. The elements 
of this resulting determinant may, therefore, be formed in any of four 
different ways. Although the individual elements thus obtained are 
different, the value of the resultant determinant remains the same. 

A straightforward extension of the method used in the above example 
shows that the process of forming the elements of a determinant repre- 
senting the product of two given determinants A and B follows the same 
general rules when A and B have any order. This result is summarized 
in the statement 


[32] 


eee enenseene | SM | ee eee eee fp te eee ee ee 
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with any one of the following four relations: 


Cig = Qiyber + Gigdea + +++ + Ginden [33] 
Cz = Oi1by_ + Aigbox + +++ + indus [34] 
Cie = Orsder + daideg + +++ + Gniden [35] 
Cin = Aide + daidon + +++ + nidne [36] 


On the right-hand sides of the last four expressions, the indexes 7 and 
k have fixed values for any one of the c’s. Since the first index refers 
to a row and the second to a column, it is recognized that the four repre- 
sentations for cj, above correspond respectively to multiplying rows of A 
by rows of B, rows of A by columns of B, columns of A by rows of B, and 
columns of A by columns of B. 


8. LinEAR EQUATIONS 


One of the most important uses for determinants is in the solution of 
linear simultaneous equations. A set of such equations involving ” 
unknowns reads 


041%, + Qye%q tess + inka = V1 
G21% + AgeX%e +--+ + denkn = Yo 37] 
OniX1 + GnoXe2 +--+ + GnnXn = Va ; 


The object is to determine the values x1 ---%, in terms of the values 
y1°**¥n and the coefficients a; which:are the elements of the determinant 
of this system of equations. This determinant is given by Eq. 1. 

Let the equations in the set’ (Eq. 37) be multiplied successively by 
the cofactors Ay, A421, *::+Ani- Subsequent addition yields the 
equation 


(041A 41-4214 91+ +++ +Gn1An1)*1 
+ (a42A 41 +@02A 91+ +++ +4n2A ni) X2 
+ (013411 +4234 01 + +++ +4ngA 1) 2 


Pe ee Dd 


+ (d1nA11+G2nAoit +: +dnnAnt)tn=Ary1tAavet-:+Aniyn [38] 


Here the coefficient of x; is recognized as the Laplace development of the 
determinant A with respect to the elements of the first column, as given 
by Eq. 18 for k = 1. Similarly, the coefficient of % is seen to be the 
Laplace development, with respect to the elements of the first column, of 
a determinant which is formed from A by replacing the elements of the 
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first column by those of the second column. This coefficient, therefore, 
is the determinant 


G2 G2 413 : An 
G22 e223 : den 


[39] 


Gn2 Ang nz : Onn 


which by rule VI, Art. 1, has the value zero. 

Similarly, the coefficient of xg is equal to the determinant A with its 
first column replaced by the third column. This is likewise zero, as are the 
coefficients of all the remaining x’s. Equation 38 is, therefore, equivalent 
to 


Ax = Ayyyy + Aarye +°++ + Ann [40] 


whence 


A A ies An 
As 1191 + ns + Ann [41] 


Tn like manner the solution for x2 may be obtained by multiplying the 
equations in the set 37 by the cofactors Ai2, Ago, -- + Ang, respectively 
and adding the results. The coefficient of x, then equals A, and the 
remaining ones are zero. Hence there results 


gos Ae + feet eohee & An2Vn [42] 

This result may be stated in general terms by assuming Eqs. 37 to be 
multiplied respectively by the cofactors Aix, Aoz, +++ Anz and adding the 
results. The coefficient of x, then equals A, and the remaining ones are 
zero, so that 


A A hat Ae Aesgaye 
wp = NE Aaa Ps + 2a — ——_ [43] 


For k = 1, 2, +--+, this represents the desired solutions. 

The numerator of Eq. 43 is recognized as the Laplace development of a 
determinant which is formed from A by replacing its kth column by the 
column of y’s appearing on the right-hand sides of Eqs. 37. Thus the 
result, Eq. 43, may be written 
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yy 2 Qye1 V1 Get 2 Gin 
@21 : dok-1 Vo Ga,k+1 : Gen 


Gnit : Onje-1 Yn On,k+i : Ann 


wy, = [44] 
G1 G2 °** Gn 
421 22 Gen 
Qn1 Gng°*** Ann 


A statement describing this form of the solution is known as Cramer’s 
rule. 

A significant feature in the derivation of these solutions is a recognition 
of the validity of the relation 


A j= 
QyiAry + Gg:Aoy +++? + aniAng = ‘6 oe 


[45] 
which justifies the step from Eq. 38 to Eq. 40. The companion relation, 
which is established in an analogous fashion, reads 


A fori=k 


0 fori e [46] 


QiApy + di2Ane + +++ + GinAnn = { 

For  k this represents the Laplace development of a determinant whose 

ith and kth rows are identical. Equations 45 and 46 may he looked upon 

as an extension of the relations expressed by Eqs. 18 and 17 respectively. 
The solutions to the set of Eqs. 37 may be written in the form 


biiy1 + b2¥2 +++ + Onn = %1 
bart + daove + +++ + bendn = %2 [47] 
bni¥1 + bn2ve +: - + Ban¥n = Xn 
in which, according to Eq. 43, the coefficients are given by 
A, 
brs = ae [48] 


In this result it is significant to note the reversal of the subscripts on A.,, 
as compared with those on },.. 
In tase the elements of the determinant A fulfill the condition 
Qin = Ai [49] 


the determinant is said to be symmetrical about its principal diagonal. 
It is clear from rule VIII, Art. 2, that the minors and cofactors of A then 


16 DETERMINANTS (Ch. I 


also have this property, that is, 
Aun = Ax [50] 


and it then follows from Eq. 48 that the determinant of the system of 
Eqs. 47 is likewise symmetrical. In that case the subscript order in Eq. 48 
is, of course, unimportant. , 

Equations 37 and 47 are mutually inverse systems. The one set repre- 
sents the solution of the other. Consequently by analogy to Eq. 48 the 
coefficients of Eqs. 37 may be written 


Bui 
ak = B [51] 
in which. 
bit " bin 
Bow] eee raceee [52] 
bat * ban 


is the determinant of the system of Eqs. 47 and B,, its cofactors. 
Evaluating the product AB of the determinants of the inverse systems 
of Eqs. 37 and 47 by substituting Eq. 48 into ees 34 gives 


x ene + ajgAne ++++ + dinAkn 


Cx = A [53] 
Reference to Eq. 46 shows that — 
1 fort=k 
aS {3 for i # k [s4] 
Hence for the determinants of these inverse systems 
bir +++ Bin M1 °° Un 1 0---0 
ein inathe 34 Sle ead. |e) Oo Tee [eed ° 185) 
bai : Onn Qn °** Ann 0 0 -1 


The determinants have inverse values. 


9. CONDITIONS FOR. THE EXISTENCE OF SOLUTIONS 


The conditions under which a system of simultaneous equations such as 
the set 37 can have solutions may be seen from the form of these solutions 
as expressed by Eq. 44. For arbitrary values of 1 - + + yn, a necessary and 
sufficient condition for the existence of these solutions is that the deter- 
minant A shall not be zero. If A is zero, in general no solutions exist. 

They may, however, still exist in case the determinant in the numerator 
of Eq. 44 is also zero, as it is if the y’s satisfy the conditions 


Ye = OyQy + agQig +--+ + andin - [56] 
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in which a1, +--+ @, are arbitrary factors. The column of y’s in the nu- 
merator of Eq. 44 is then a linear combination of the other columns of this 
determinant and can, by repeated application of rule IV, Art. 1, be re- 
duced to a column of zeros. 

When the y’s are expressed by the relations 56, the Eqs. 37 can be 
rewritten in the form 


41 (X14 — 04) + G12 (%2 — ae) +++ + @in(%n — an) 
a1 (%1 — 01) + Goa (%e — a) + +++ + Gon(%n — an) 


Ons (41 — 1) + dng (%2 — arg) +s + nn Gn — Qn) =0 


A special case of this sort occurs when all the ’s are zero. Then a, = 
ag = +++ =a, = 0, and Eqs. 57 become identical with Eqs. 37 for 
V1. = Yo = +++ = In = 0. This is called the corresponding homogeneous 
set of equations. For these, the Cramer rule as expressed by Eq. 44 yields 
the solutions in indeterminate form except when A = 0, but then the 
solutions are all zero. They are spoken of as the trivial solutions because 
their existence is at once evident upon inspection of the homogeneous 
equations. 

Nontrivial solutions to the homogeneous set of Eqs. 57 exist only if the 
determinant is zero, but Cramer’s rule, Eq. 44, is of no use in determining 
them. In order to see how this difficulty might be overcome, it is helpful 
to consider the Eqs. 37 for the special case that one of the y’s, for example, 
y;, alone is different from zero. Then, if it is assumed for the moment that 
the determinant is not zero and Cramer’s rule or Eq. 43 is applied, it is 
found that 

A iKyi 


a [58] 


In this special case the ratio of any two unknowns is given by 


Br Ary 

Xe Aig [59] 
which is independent of the values of both y; and the determinant A. 
It may be inferred, therefore, that Eq. 59 holds also when both y; and 
A are zero. : 

The correctness of this conclusion is demonstrated in a rigorous fashion 
in Art. 7, Ch. III. Meanwhile it is interesting to note that when the 
homogeneous set of equations has nontrivial solutions, these are not 
uniquely determined by Eq. 59, in which only the ratios of the unknowns 
are given. Any value can be assigned to one of them, and the remaining 
unknowns are then expressed in terms of this one. 
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The general conditions for the existence of solutions may be discussed 
as follows. The fact that the inhomogeneous equations 37 have solutions 
only when the determinant is not zero simply amounts to stating that 
these equations must be independent. If one is a linear combination of 
the others (in this case the determinant vanishes), then, speaking in 
physical terms, the data are insufficient to yield an explicit answer. 

In case the right-hand members of the Eqs. 37 are zero and all the equa- 
tions are independent (A # 0), the system is overspecified from a physical 
point of view. The situation is like a deadlock, and nothing can happen; 
that is, only zero values for the unknowns can satisfy the equations. If 
one of the equations is a linear combination of the others (A = 0), this 
one may be discarded and one of the terms in each remaining equation, 
for example, that with x,, transposed to the right-hand side. These 
(nw — 1) equations may then be solved for the (m — 1) remaining un- 
knowns in terms of x,, provided the determinant of this reduced set. is 
not zero. If it is zero, this method fails, but so does the corresponding 
form of solution expressed by Eq. 59. 

This kind of failure in the method of solution indicates that two inde- 
pendent sets of solutions exist, but it is difficult to obtain a clear 
picture of this situation without the aid of such appropriate geometrical 
interpretations as are given in Ch. IIT. The present discussion is completed 
in that chapter. The material of the following article, however, is helpful 
in summarizing some of the characteristics of the determinant which are 
pertinent to the present problem. 


10. THE RANK OF A DETERMINANT 


If in the determinant A, Eq. 1, there exists among the elements of each 
column the same linear relation 


001% + A2dex + °°: + andnk = 0 gy a 1, 2, Peet n) [60] 


in which the a’s are arbitrary nonzero factors, the elements of any row 
are expressible as linear combinations of the corresponding elements of the 
remaining rows. If some of the factors a1--+ a» are zero, this fact still 
holds for the elements of some of the rows. By repeated modification of the 
determinant according to rule IV, Art. 1, any one of these rows can be 
reduced to a row of zeros. Hence it is seen that a determinant is zero if a 
relation of the form given by Eq. 60 exists in which at least one of the a’s 
is different from zero. 

Conversely, if the determinant is known to be zero, it is surely possible 
to find a relation of the form of Eq. 60, as is clear if Eq. 60 is written out 
for all the k-values, thus 
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@y101 + dgiag +++ + dnian = 0 
@y201 + Ageaq + °++ + near = 0 : [61] 
QnA + Gene tte + Annan = 0 


This is simply the homogeneous system of equations in which the deter- 
minant has its columns written as rows. It is called the transposed set of 
equations. According to property VIII, Art. 2, the transposed determinant 
is zero if the given one is. Hence, by the method discussed in the previous 
article, a set of nonzero a-values can be found to satisfy Eqs. 61. 

It is then also possible to find a set of nonzero 8-values satisfying the 
relations 


Bidiy + Bodig +--+ + Brdin = 0 (i =1,2,---n) [62] 


since they are simply a set of solutions to the untransposed system of 
homogeneous equations. 

It follows that if the elements in a row of a determinant are expressible 
as linear combinations of the corresponding elements of the remaining 
rows, the elements in a column of this determinant are similarly ex- 
pressible. 

When a determinant of order 7 is not zero, so that neither the relations 
60 nor 62 exist (except for zero a- and £-values), it is said to have the rank 
y = n. If the determinant is zero but at least one of its (7 — 1)-rowed 
minors is different from zero, one set of a- and #-values exist which 
satisfy the Eqs. 60 and 62. The determinant is then said to have the 
rank ¢ = » — 1. If the determinant and all its (n — 1)-rowed minors are 
zero but at least one of its (n — 2)-rowed minors is different from zero, it 
has the rank r = » — 2. In this case it is possible to find ¢wo independent 
sets of a- and §-values satisfying Eqs. 60 and 62, and the homogeneous 
system of equations has two.independent sets of solutions, as is discussed 

_further in Ch. III. Corresponding statements for the condition that the 
rank is n — 3, n — 4, etc., are obvious. Finally, if the rank of a deter- 
minant is zero, all its elements are zero. 

The definition of rank facilitates the discussion of possible solutions to 
a system of linear equations. Thus the nonhomogeneous Eqs. 37 have 
solutions only when the rank of their determinant is 7. The corresponding 
homogeneous equations have nontrivial solutions only if the rank is less 
than , and they have p sets of independent solutions if the rank ism — p. 
A more lucid exposition of this line of reasoning is found in the chapters 
immediately following. 
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PROBLEMS 
1. Determine the rank of each of the following determinants: 
(a)|1 2 3 4/()| 3 9 20 18]/@)| 5 OO -2 9 
4 6 


2 8 8 19 40 37 G {Seti oF 
36 9 12 13 20 47 34 <2 mit. (28 219 
4 8 12 16 20 22 59 31 9 7 -19 26 
(4)| 12 -5 8] ()|39 24 12 5] (165 65 4 
4 0 1 a 2 2-2 65 17 8 
Shs gre 12 2 10 3 4. 8 4 
598. 3 4 


2. Transform each of the above determinants to the triangular form, thus finding 
their values and checking the answers to Prob. 1. 

3. For each determinant in Prob. 1 whose rank is less than its order, find relations 
of the form of Eqs. 60 and 62. 

4. Using determinants (e) and (f) of Prob. 1, write down corresponding sets of 
simultaneous equations, denoting the right-hand members by 41, yo, - + as in Eqs. 37. 
Solve these equations by means of Cramer’s rule. 

5. Repeat the solutions of the equations in Prob. 4 by means of a systematic 
elimination process. Compare the total number of multiplications and additions with 
those required in the solutions using Cramer’s rule. 

6. Evaluate the following determinant according to the pattern shown in Eq. 20. 


2 14 3 
6 -1 2 —4 
3-2 5 1 
-5 64 -1 


Repeat the evaluation through reduction to the triangular form and compare the 
total numbers of multiplications and additions required in the two methods. Derive a 
formula giving the total numbers of multiplications and of additions required for the 
evaluation of an xth order determinant by the method involving its reduction to the 
triangular form. 


7. Determine the solutions to a set of equations (like Eqs. 37) having the ac- 
0.5 0.5 05 05 
—0.866 0.289 0.289 0.289 


0 —0.258 0.408 0.408 
0 0 —0.707 0.707 


companying determinant. Compare the set of equations representing the solutions 
with the given equations and note any obvious mutual relations existing between 
these two sets of equations. 

8. Given the two sets of equations 


n 
Liste =y (= 1,2,-++n) 
k=1 

and 


n 
Dy bua = He (k = 1,2,---) 
s= : 
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which are solutions of each other, show that the corresponding determinants A and B 
have inverse values; that is, AB = 1. A proof may be based upon the rule for forming 
the product of two determinants. : 


9. In the following special nth order determinant 


Dz, =)0 1 @ ieee 


ey 


all the elements of the principal diagonal are equal to a; those on the diagonals im- 
mediately above and below the principal diagonal are unity, and all the rest are zero. 
Derive the following recursion formula: 


Dn = aDn—1 _ Dn—2 


applicable for. = 1, 2, --- with the definitions: Dy = 1 and Di =a. From this 
recursion formula obtain an explicit expression for the determinant of order ” which 
reads 


_ sinh (n + 1)7 : 
mre sinh 
10. If the first and the last elements on the principal diagonal of the determinant 
in Prob. 9 are replaced by a/2, show that the resulting determinant has the value 
D, = sinh (2 — 1)y° sinhy 
while if only the first or last of these elements is a/2, the value is given by ~ 


withy = cosh“! 


D, = cosh ny 
11. Consider the determinant 
dy 1 0 0 O O-::- 
-1 do 1 0 0 O--- 
D= 0 -41 d3 1 O O---| = K(di,°°*dy) 
0 O -1 ds 1 O-- . 


eee emer eer eee e er eserecer ese 


and show that this function K(di,--+ dn), called a simple continuant, possesses the 
recursion formula 


Kd, pea dp) os d,K (di, oe da-1) + K(d, +++ dno) 
with 
K(d;) = d, and K(Q) =1 
12. Denote by D4: the cofactor formed through canceling the first row and column 
in the determinant given in Prob. 11. Make use of the results of Prob. 11 to show that 
D _ K(d1,-++dn) igre oer © 
Soe SE ae dy eet eet et 
Dy K(d2,--- dn) a Vide ide hia 
which is known as a continued fraction. 
13. Show that 


+ere 


K(ds, +++ dn) = K(dn, +++ 1) 
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and that the recursion formula given in Prob. 11 may alternatively be written 
K(dy,+++ dn) = d1K(do,--- dn) + K(ds,°-- ds) 


14, Show that the partial derivative of a determinant with respect to one of its 
elements equals the cofactor of that element. In symbols: 0A/Oasx = Ack. 

15. Consider the cofactors A, and Ae, corresponding to the elements a,, and dar 
in the same row of a determinant A. Show that the sum (Ag, + Asr) is equal to 
A,x with the column involving the elements a,, replaced by one with elements 
sr + dgx(— 1)", or to Ag, with the column involving the elements a,, replaced by 
one in which the elements are asx + dgr( — 1)". 

16. Using the type of reasoning involved in the previous problem, show that the 
fourth-order determinant may be written as the following sum of two third-order 
determinants: 


dei 22 (ai3d24 — 434023) 
G31 G32 (13034 — 414033) 
G41 G4 (Ai3044 — C1443) 


(411022 — 412021) 423 de4 
(431432 — @32031) @33 G34 
(G@1142 — @32041) 443 G44 


+ 


or as a variety of obvious modifications of these forms. 
17. Express in determinant form the condition that the three straight lines defined 
by 
ax + diy + dis = 0 ' 
aoix + aa2y + d23 = 0 
dgix + azey + a33 = 0 


shall intersect at a common point. 
18. In the XY-plane the origin O, and the two points P(x1,yi) and Q(x2,¥2) 

determine a triangle. Show that the area of this triangle is expressible by means of the 

determinant 

1 

2 


41 41 
x2 Ye 


19. Using the result of the previous problem show ‘that the area of a triangle 
determined by the points (%1,y1), (%2,ye) and (x3,¥s) is expressible as 


1 xy Yi 1 
2 x2 Ye 1 
x3 yg 1 


Write the condition for which these three points lie on the same straight line. 


20. ax + by + cz +d =0 is the equation of a plane. Its intercepts on the co- 
ordinate axes are: x = —d/a, y = —d/b, 2 = —d/c. Let m denote the normal from 
the origin to the plane. Its direction cosines are: | 


ae ees or eee 
-@) -G) 


Since the sum of the squares of these cosines equals unity, one has 
ie ieee, 
Ve+R +2 


n 
cos (1,2) = — 


=G) 


cos (n,x%) =- 
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Consider a point Po(x0,yo,20) for which axo + byo + c%o +d = Do. Subtracting this 
equation from the original one gives a(x — xo) + b(y — yo) +(e — 20) + Do = 9, 
from which it is clear that the length of the normal dropped from the point Pp to the 
plane is 


Do 
a/ a? + b? + c2 
These results and that of the previous problem are to be made use of to show: (a) 


That the equation of a plane passing through the three points P(x1,91,81); 
Po(22,y2,8e) and Ps(xs,y3,23) may be written in the form 


no = 


x y @ 1 

x 1 
D(x,y,2) = ps s zq 1 =0 

x3 ys 23 1 


(b) That the cofactors of the first three elements of the first row, that is, 


yf 1 1 4% 1 v1 yi 1 
Yo 22 1|> Zo Xe 1|> x2 Ye2 1 
Y3 33 1 23 X3 1 %3 Y3 1 


are equal to the projections of the area of the triangle P,P2P3 upon planes normal to 
the X-, Y-, Z-axes respectively and that the square root of the sum of the squares of 
these cofactors equals the area of this triangle. (c) That Do = D(xo,yo,20) divided by 
this square root equals the normal distance of a point Po(x0,¥o,20) from the plane, and 
hence that the volume of the tetrahedron whose vertexes are the points PoPiP2F> is 
equal to one-sixth the value of the determinant Do. 


21. Three planes passing through the origin are represented by the equations 
ax + aey + a3 = 0 
bix + bey + baz = 0 
ou + coy +632 = 0 


Express in determinant form the condition for which these planes intersect in the 
same straight line and find the expressions for the direction cosines of this line. Given 
any two planes, what are the direction cosines of their intersection? 


22. Write the following determinant as a polynomial in A: 


@1—-X az 413 
PQ) =| aa1 G22 —D a3 
31 a32 a33 — 


and obtain expressions for the coefficients of the polynomial in terms of the determi- 
nant A and its cofactors. Indicate the forms of these expressions for an nth degree 


polynomial. 
23. Prove that 
(a) (11 + b11) €12 °** Cin a1 C12 *°*°* Cin bir Cia °°? Cin 
(de1 + be1) Cag *** Can} _ | dar Can s*> Con a bar Ca2 °°" Con 
(@n1 + Bn1) Cn2°°* Cnn GQn1 Cn2°°°* Cnn bni na *** Cnn 


(b) If Ce =Ox+jbx 4,8 =1,2,3 g=V—-t 
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then 
411 G12 dis Qu bie dig bir ie O18 bir bye ais 
\cix| = | G21 Gee Gag) —| Ger bez bes | —| bar Gee bes) —| bar bea aes 
a3, a3q 33 31 32 bsg bs1 age 533 Bsr bse 33 
4141 ai b13 @11 big aig bir dig 413 bir bie Bag 
+j4| Ge1 deg bes | +] aor baz aes} +] ber 22 Ges | —} bar Dee des 
43, G3 533 a3, bs dag b31 G32 33 bar B32 ba 
(c) If dix = ak — jbix 1, k= 1, 2, 3 ‘ 
then 
\du| = [cx] complex conjugate of |cix| 
24, Let 


Cue = On +7by, 1,k =1,2,+--+n j=zV-i1 


Prove that |ci,| is a complex number with the following law of formation: 
. (a) |cu| is equal to. the sum of 2” determinants of order , 2*~) being real and 
2-1 pure imaginary. 

(b) The real determinants have an even, or zero, number of “b” columns. The 
imaginary ones have an odd number of “ b ” columns. 

(c) If m is the number of “ b” columns in any determinant in the expansion, the 
sign and complex character of the determinant are given by j”. For a given value of m, 
there are several determinants which contain m “b ” columns; their number is given 
by 


Re ee ee ee 
m} 


(d) By using the above properties show incidentally that 


oi ~1)(n —2 
ton 4 2ee yeas ue deep Biomn 


25. If vie = yu(t), for i,k =1,2,-+-m, are mn? single-valued, differentiable 
functions of the independent variable 7, show that 


d va yiet** Yin Yur gaat? Din Vii yaa tts yan 
q butl = Yar Years Yan), | Yar Yaz se* any. 4 Yor Yoo Von 


peer eee reser e | Tf eee eer ene seree J © "| evar evevr stores 


yan ye 0 Win Yur 32 "7" Yin yu Vie 8! Van 
— |321 Yaz °° * Yon 4 [Fa Yaarrs Van Lutter Yor Yar °° Yon 


es a ns ie ee a cay 


Yni Yn Ynn Yn1 Yn2 °°° Yan Yul Yu2*** Yan 
in which 
dys 
oa = i 


26. In terms of the 7 independent functions 
ye = yu(t) fork =1,2,---n 
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(differentiable up to and including the th order), construct the following determinant 
(the so-called Wronskian of those functions): ; 


5] 7 ne ne? 
Fa SNE Mee eT a 
Aalto ya on Ve = Te) 


yD ygm—Dyg(M—D 2s y, (9D 


Tf these functions are connected by a linear relation of the form 
Ay + Aye + Asys +°* + Andn = 9 


in which the A’s are constants, show that the above determinant vanishes identically. 
Hint. Differentiate the linear relation successively 7 — 1 times so as to obtain 


Ay’ +e0 + Any’n = 


Pe ee 


Ay) feee be Anya) =0 


Together with the original relation, one then has a set of m equations. From these, the 
value of any function y;, for example, 1, and its first (n — 1) derivatives can be 
obtained, Substitution into the Wronskian, followed by an expansion according to 
columns, leads to the desired result. 


27. Using the determinant, A, as defined in Prob. 26, show that 
1 y2 ¥3 88 Vn 
d , ? , 
a ee eee eee : 
yy r—® gi?) gir cere Vn? 
yi yo yg eine ya 
Hint. Use the result of Prob. 25 and observe the resulting structure of the rows. 


28. If (with reference to the situation given in Prob. 27) there exists a set of 
relationships ‘(differential equations) of the form 


Boys + pre) + pay? +++ + dade = 0 


for 
r=1,2,-+9n" 


in which the coefficients po, 1, 2,*** Pn are constant or variable, show that 


4,2-% 
(a) di sr, do A 
(b) A = Age 1 /p0) & 


in which Ao is the integration constant. 
Hint. Give r the values 1, 2,--- and obtain, from each equation, the value of 
y,(", Substitute these values into the last row of the expression in Prob. 27. 
. 29, Given 
ax, = xe(t) fork = 1,2,-++m (asystem of m unknown functions of ¢) 


yn = yet) fork =1,2,--+” (a system of 2 known functions of #) 


and 
ay, fori,k = 1,2,--+m (a collection of n? constants) 
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These quantities are related by. the following system of first-order, first-degree, linear 
equations 


Oh, + Goyke +++* + dnrtn = ye fork = 1,2,---n 


Show that a solution of this system (the particular one) is given by 


Auf yidt + Am f yd +. + Ant f yo 


v% = [ain] 


in which the A ,’s are cofactors of the determinant |a,z|. 
30. Show that 


Lotoered 
1 2 QBe+. Qed 
3 3% +++ Bm 2] =11K2IX31+--(m—1)! 


ee 


Hint. Reduce the determinant to the diagonal form. Use Barlow’s tables of squares 
(pages 202 to 206) for the powers of integer numbers and observe the law of forma- 
tion. 


31. 
Un = Un(41°°+ Xn) fork =1,2,-+-n 
are » single-valued differentiable functions of the independent variables x1, x2, +++ ap». 


The “ Jacobian” of these functions is, by definition, the following functional 
determinant: 


Ox, Oxe Oxy 

dug Aug || dus 

= (He) - x, Ox, °  Omy 
HyXQ °° * Xn 

Dtty On | thn 

0x1 Ox Oxn 


Suppose the variables x1 + - - x, are changed to the new independent variables 2: +++ z, 
according to the equations of transformation 


= Xn (Z1°°* Sa) fork =1,2,-++n 
The original 4, functions are now 
Uy = Un(Z1°+*8,) fork = 1,2,+++n 


and their Jacobian with respect to the variables 21+ - + z,, for example, J;, differs from 
J only in that the variables x are replaced by corresponding z’s. 
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(a) Show that the above Jacobians are connected by the relation 


98) OAL), Oe 
021 022 O2n 
Ore: Ooe | Oe 
Ji =| 9%1 92g Oin |X J 


we eee seerseeee 


(b) Extend the above result so as to consider subsequent transformations of the 
form : 


2; = 3;(71,72,°** tn) forj =1 2,:++n 
Tp = p(t, t2,**+tn) forp =1,2,---” 
(c) What happens with the last Jacobian when any intermediate functional 
determinant is identically zero? Hint. Apply the rule for differentiation which reads: 
Ou, SP Out, Ox; 
Qe, 1 Ox; des r,s =1,2,-+-n 
32, Let 
Gy, = Ojn (41, %2,°°° Xn) forj,k = 1,2,--+" 
be a system of 7? differentiable functions of the independent variables x1, 72° * + Xn. 


Through the introduction of a new set of independent variables #1, Z2, - - - Z, by means 
of the functional relations 


xp = XE (Fi, Be,°** Zn) fork =1,2,°-- 


the system of functions @;, in the old variables goes over into the transformed system 
4; in the new variables. 
Accepting the result that a;, goes over into 4; in accordance with the law of 


transformation 
Gpg = Li, o# oe 3, (p,q = 1,2,+++) 
prove that the determinant |a;:| is transformed as indicated by 


[pel = lain] xX J? 
in which 
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33. The expression for the three-dimensional volume element in a general system 
’ of co-ordinates is given by 


aV=V |g d%, dd, dé3 
in which 
rer Ox, Ox, 
ya OF; OZ, 


Sik = y=1,2,-+:m 7 =1,2,+-+n k=1,2,---n 


If the co-ordinate system is orthogonal, the gj, system has the property 


_{[#90 forj =k 
Sik) ~ 0 forj =k 


Check the values of \/|¢;,| for the different co-ordinate systems and laws of co- 
ordinate transformation given in the following table: 


Name Equations of Transformation VJ lez 
(|x = 24 
Cartesian {lxe = Zo i 
C3 = Z3 
7 x1 = %1cosz 
Circular ee - as oe ig fs 
cylindrical }|*2 ~ 715) 72 
x3 = F3 
ae 1 = ¢ cosh %1 cos % 
Elliptic Ps = csinh zy sin Bp c?(cosh? % — cos? 3g) 
cylindrical Re 
x3 = Z3;c¢ = const. 
12.2 2 
; 1 = (21° — & 
Parabolic bee = che 2’) (812 + 352) 
cylindrical }|*? ~ 71°? 
x3 = X3 
asinh 71 
bate a 
; cosh %; — cos #2 F 
Bipolar ec a 
aes asin %2 Yoke ear 
cylindrical ||y_ = —————-—_ (cosh %; — cos Ze) 


cosh %1 — Cos Ze 
x3 = %3;a = const. 
%1 = c cosh 21 cos %2;c = const. 
Spheroidal ;|x2 = c sinh 2%) sin Ze cos 23 c®(cosh? #1 — cos? Ze) sinh 2; sin 22 
3 = c sinh 2 sin Ze sin 33 


21 = 2% COS Ze sin Z3 
Spherical {|v = %1 sin Ze sin 23 #1? sin %3 
23 = B1 Cos #3 


34, Given the multiple integral 


T= ff [Pea 0) der dns de 


in which x1, %2,-+- %, are the independent variables. If new variables %1, Ze, ++ <2, 
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are introduced in accordance with the relations 
xy = Xp(H1, B2,°°* Fn) fork =1,2,-°°m 


it can be shown that the above integral becomes . 


1m ff fre te te) dan dae: ++ dan 


in which J is the determinant given in Prob. 32. 

Compute the value of the determinant J for each set of transformation functions 
given in the second column of the table in Prob. 33. 

35, Given the following system of m + m linear equations involving the m +1 
unknowns x for = 1, 2,--+, and e, for p = 1,2,--- m, with n > m: 


B=n 


P=m™ 
2X, Ante — ex + XZ, coon =0 ford =1,2,---” 
= = 


u=n 
LX Aout, = 90 forp =1,2,---m 
p=l1 
(a) Write in determinant form, according to the above order of these equations, 
the condition for the existence of nontrivial solutions. 
(b) Show that this determinant is a polynomial in e of the degree n — m. 
(c) Using Laplace’s development with respect to the last m rows, show that the 
total number of mth-order minors which can be formed is given by 


(n + m)(n +m —1)(m +m —2)--: (n +1) (m <n) 
m! 
and that only 
n(n —1)(n —2)---(n —m +1) 
m\ 


of these are not necessarily zero (the rest being identically zero). 


CHAPTERII 


Matrices 


1. LINEAR TRANSFORMATIONS 
In Art. 8 of Ch. I, the system of linear equations 


41% + Ay2Q%—q + +++ + Ankn = V1 
GgiX% + AgeX%e + +++ + Gann = Yo [1] 


a 


OniX1 + AngXe + +++ + AnnXn = Yn 


is considered to relate a set of unknown quantities x; --- x, to a set of 
known quantities y1---%¥,. An alternate point of view, which plays an 
important part in the analysis of physical problems, is to look upon these 
equations as relating a set of given variables x; -+- x, to a new set of 
variables y; - - - Yn. The equations are said to ¢ransform the old variables 
into new ones. In this sense the set of Eqs. 1 is spoken of as a linear 
transformation. 

The transformation is characterized completely by the coefficients 
a. Since not only the values of the coefficients but also their relative 
positions in the equations are significant in this respect, a symbolic form 
for the characterization of the linear transformation is given by means of 
the rectangular array 


1 2 Ain 
Q@ =| C1 2 °° Fan [2] 
ant an2 ann 


which is called the matrix of the transformation. 

Offhand, it might be thought that the determinant could serve this 
purpose, but here it must be recalled that the determinant is a function 
of the coefficients and not merely a symbolic representation of them in 
their relative orientations. The determinant has a value; the matrix, 
on the other hand, is merely a picture and has no value other than that 
which it conveys by its structural composition. For the moment it has but 
one reason for existence, namely, that it is easier to write down than the 
system of Eqs. 1, yet contains essentially the same information. 

In outward appearance the determinant differs from the matrix only 
in that the latter is enclosed in square brackets whereas in the determi- 
nant the elements are enclosed between vertical lines. In this book, an 
upper-case script letter is used to denote the matrix and the upper-case 
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italic letter denotes the corresponding determinant. Thus the matrix 2 
is denoted by @ and the corresponding determinant by A. 

"Just as determinants facilitate the study of linear equations and re- 
lated problems, so the algebra of matrices is justified in that it facilitates 
the manipulation of several sets of linear transformations. The rules of 
matrix algebra are chosen with this end in view and are otherwise quite 
arbitrary except that they must, of course, be self-consistent. 

A matrix does not necessarily have the same numbers of rows as 
columns. Its form is therefore more flexible than that of a determinant, 
which must always be square. If the matrix has m rows and m columns, 
it is referred to as having the form m by 1, or it may simply be called 
an (mn)-matrix. When m equals n, the matrix is said to be of order 
n. In the extreme cases in which the matrix consists merely of a single 
row or a single column, it is referred to as a row matrix or a column 
matrix respectively. These special forms occur frequently enough to 
warrant the use of some abbreviated notation to distinguish them. 

The notation used in this text for a row matrix is 


SSE aged [3] 
A column matrix is indicated as shown by the equation 
Bi 


Bo 
plea a [4] 
Bn 
At times it becomes necessary to have the abbreviated symbol for a 


matrix indicate the number of rows and columns involved. A matrix 
having m rows and columns is written 


Cos eee [5] 


Gm1 *** Gmn 


A less specific notation, however, is sufficient when the number-of rows 
and columns either is clear from the discussion involved or is immaterial. 
If a rule for the multiplication of matrices is properly chosen, the linear 
transformation, Eq. 1, may be written as a matrix equation. One of the 
four alternative rules for forming the product of two determinants (dis- 
cussed in-.Art. 7, Ch. I) may be used for this purpose. It is conventional 
to choose the rule expressed by Eq. 34, Ch. I. Thus in order to form 
the product @ of two matrices @ and &, as indicated by the equation 


@xB=e [6] 
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the elements in €C are obtained by multiplying the rows of @ by the 
columns of &, as described in detail for the multiplication of determinants. 

With the adoption of this convention, the matrix equation replacing 
Eq. 1 reads 


x1 y1 
41 G2°*** An X2 Ye ; 
G31 G22 +*: Aan x ae ae (i (7] 
Qn1 ne ann 

Xn Yn. 


which may be abbreviated as 
@ Xx a] = yl [8] 


The element y,, which is the (1,1)-element in the column matrix y], is 
obtained by multiplying the elements in the first row of @ by the corre- 
sponding elements of the first column (the only column) of x] and adding 
the results. This gives 


Qy1%1 + QyoXe ++ +++ Qnkn = V1 {9] 


which is the first equation in the set 1. Similarly, ye which is the (2,1)- 
element in the matrix y], is formed from the second row of @ and the 
first column of x], thus 


021% + Ago%o + +++ + dentn = Yo [10] 


This is the second equation in the set 1. The remaining equations in this 
set are similarly contained in the matrix equation 7 or 8. 

It should be noted that the column matrix y] has no elements for the 
position (1, 2), (1, 3) --- or (2,2), (2,3)---, etc., because it has only 
one column — the first column. For the missing elements, or zero elements 
in y], the matrix product expressed by Eq. 7 automatically yields zeros 
according to the adopted product-forming rule because the column 
matrix x] has no second, third, - - - , etc., columns; that is, the elements 
in these missing columns of the matrix 2] are all zero. 

The present argument requires for completeness a statement of the 
rule that two matrices are equal only when all corresponding elements 
are equal. Specifically, if two matrices # and 2 with elements ,, and 
Qrs are to be equal, that is, if 


P= C11] 
it is necessary that 
Drs = Ire [12] 


for all values of the indexes r and s. For example, the left-hand side of 
Eq. 9 is the (1,1)-element of the product of @ and 2] in Eq. 7, and ,, 
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the right-hand side of Eq. 9, is the (1,1)-element of the matrix y] in 
Eq. 7. This rule for the equality of two matrices is, however, so obvious 
that it hardly needs emphasis. 

It should be clearly recognized that the objective of obtaining a matrix 
equation equivalent to the transformation 1 could have been met just 
as well by the adoption of any one of the other three methods of forming 
elements in the product matrix according to the rules of multiplying 
determinants and by suitable modification of the form of the matrix 
equation. The choice of the rule by which the rows of the first matrix 
in the product are multiplied by the columns of the second is arbitrary, 
but once it is adopted, consistency requires that it be looked upon as the 
rule. There are no alternatives as in the multiplication of determinants, 
because the alternative procedures, if applied to the example illustrated 
by Eq. 7, do not all yield the same final set of equations. 

That the adopted rule for matrix multiplication is not compatible with 
the commutative law in forming algebraic products is likewise apparent. 
Thus if @ and & are two matrices, the product @ x & is different (and 
has a different manipulative significance) from the product 8 x @. 
Thus in general 


Q@xB¥Bxa [13] 


in matrix multiplication. 

The utility of being able to write a transformation in the compact 
matrix form is appreciated when several transformations are to be carried 
out in tandem. In addition to the transformation 1 from the variables 
%1 +++, to the variables ¥, --- a, it may be necessary to transform 
subsequently to a set of variables 2; - - - 2, by means of the equations 


birt + Sieve + +++ + inde = 21 
boiy1 + boave +++ + banYn = 22 | [14] 


bay + bn2ve ip Na a Dann = Zn 


with the transformation matrix 


bir epee Bin 
Ba] ccceecees [15] 
bay es eS 
With the additional definition of the column matrix 
21 
Ze 
=!) [16] 


Zn 


ea tae ots om 
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this transformation takes the compact form 


BX y] = 2] [17] 
The direct transformation from the variables %1 --- %, to the final 
variables 2, --- %, is obtained by substitution of the expression for y] 

according to Eq. 8 into Eq. 17, which gives 
Bx xX x] = 3] [18] 

or, more compactly, 

€ x x] =3] [19] 
in which the resultant transformation matrix C is given by the product 
C=8x@ [20] 


The convenience of the matrix method of handling transformations 
becomes clear if it is contrasted with the additional labor involved when 
a specific example of the above transformation is carried through by 
means of the usual substitution process. The individual transformations 
may be assumed to be the following simple ones: 

Qy1% + Ay2%_ = a 
21% + Ag2%2 = Ye [21] 
and 
biiy1 + Dieve = 21 
boiv1 + beove = 2 [22] 
Substitution of Eqs. 21 into Eqs. 22 gives 
by (411% + 12%) + Bio (de1%1 + dee%e) = 21 [23] 
bei (Q11%1 + 12%) + boo (deity + Ao2%2) = 22 
Multiplying out and collecting terms with x, and x2, one obtains 
(yxd11 + by2001)%1 + (011412 + by2G29)%2 = 24 [24] 
(ba1d11 + boede1)a + (bordie + bo2de2)%2 = Be 


The coefficients in these equations are recognized as the elements of a 
matrix formed from the product 


bit al i el 
x 2 
is bos 421 d22 [el 
The ease of manipulation and circumspection resulting from the use 
of the corresponding matrix equations is proved even more conclusively 
in the applications discussed in Ch. IV. Meanwhile it is necessary to 


study the rules governing various additional algebraic manipulations 
with matrices. 
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2. ADDITION OF MATRICES 


By the sum of two matrices @ and & is meant a resultant matrix C 
whose elements are equal to the sums of corresponding elements of @ and 
Bs Explicitly, if 


GQ = cn hikes: Be es: |i ewan rt [26] 


then 


Qt B=C HI creer rere e cece ere reees [27] 
(Qt + Bini) mee (Gn + bmn) 


Similarly, the difference @ — & yields a matrix whose elements are 
the differences of corresponding elements of @ and &. 

Evidently the matrices @ and 8 should have the same number of rows 
and columns, or the missing rows and columns of one may be regarded 
as composed entirely of zeros. 


3. MULTIPLICATION BY A FACTOR 


If a matrix is multiplied by a factor, each element of the matrix 
becomes multiplied by this factor; thus 


kx nes hehsian e See gy aided see wacis [28] 


Omi *** Imn Rami * + Rann 


As is clear from the fact that if the factor & is zero, the matrix must be 
zero, which is true only when all elements of the matrix are zero. 

Tt should be observed that this rule is distinctly different from the 
corresponding one for determinants, in that the determinant becomes 
multiplied by a factor when the elements of only one row or column are 
multiplied by this factor. 


4, MULTIPLICATION OF MATRICES 


The operation of multiplication is introduced in Art. 1 above, but 
several additional remarks are necessary to complete the discussion. 
The individual matrices entering into the product @ X & may not have 
the same number of rows as columns, but it is clear from the manner in 
which this product is carried out that the number of columns in @ (the 
number of elements in a row of @) must equal the number of rows in B 
(the number of elements in a column of &); otherwise some of the 


36 MATRICES . (Ch. 1 


products of elements in the rows of @ with corresponding elements in the 
columns of & cannot be completed, for want of either one or the other of 
the corresponding elements. For example, when expansion of the follow- 
ing product, 


G1 M2 AN3 bis el 
x 29 
i Q22 | is bee [29] 
_ is attempted, it is found that there are no elements in the matrix & by 
which the elements a13 and @23 may be multiplied. These matrices are 
said to be nonconformable. They become conformable when a third row is 
added to the matrix 8. This row may, of course, consist entirely of zeros. 


Curiously enough, conformability does not require that the number of 
rows in @ be equal to the number of columns in &. Thus the formation 


of the product 
a a a bu 
[ 11 12 sa x bs; [30] 
G21 422 93 b 
31 
is considered straightforward, it being tacitly understood that the matrix 
® is interpreted as equivalent to 


by, 0 
bes 0 P31] 
bs, 0 . 

but mever that it is equivalent to . 
0 dy , 
0 dex _ [32] 
0 bai 


It is useful to observe that the matrix formed from the product @ x B 
has as many rows as @ and as many columns as 8%. For example, the 
product 30 yields a matrix with two rows and one column. In general, 
if the matrices are denoted more specifically as [@mn] and [bap], then 


[amn] X [ono] = [cmp] [33] 


These matrices are conformable because @ has 2 columns and & has a 
rows. 
Similarly the product of three conformable matrices reads 


Lamn] X [dnp] X [eq] = [dma] [34] 


Here the resultant matrix has as many rows as the first matrix and as 
many columns as the last one entering into the triple product. 

To carry out the product of three or more matrices such as @ X B X C, 
the component product @ x & may be evaluated first and the resultant 


Art. 4 MULTIPLICATION OF MATRICES 37 


matrix multiplied into C, or @ may be multiplied by the resultant of 
® X €. In symbols this procedure is stated by the equation 


a@xBxeC=(@x B)xC=@x (Bx C) [35] 


In other words, the associative law holds in matrix multiplication. 
This fact may prove useful in minimizing the amount of labor involved 
in carrying out a multiple product. Observe that the multiplication of the 
two matrices 

[amn] x [dnp] [36] 

involves the operations 
mxnxXp multiplications 37) 
m xX (n-1)X p additions 


Consequently a triple product in the association 
(L@mn] X [bnp]) X [¢pq] [38] 


involves the operations 
mXnxptmxpxy multiplications 


mX(n—-1)xXp+mx(p-—1)Xq additions [39] 
and in the association . | 
[ann] x ([bnp] x [¢pq]) [40] 
the operations are 
nXPXqQ+rmxnxXg| multiplications rat] 


nX(p—1)Xq+tmxX(n-—1)Xq additions 


As a numerical illustration let it be supposed that m = 1, n = 6, 
p= 1, and g = 4. Then the operations 39 are ten multiplications and 
five additions, whereas the operations 41 amount to 48 multiplications 
and 20 additions. 

Since, as pointed out earlier, the commutative law does not hold, the 
order in which the matrices enter into the multiple product must be 
carefully observed. In Eq. 35, for example, the component product 
(@ X &) is postmultiplied by C, and the component product (B x C) is 

* premultiplied by @. 
In this connection it is to be noted that the distributive law is valid 
also, with the caution that attention must again be given to the dis- 
tinction between premultiplication and postmultiplication. Two cases 
are illustrated in the equations 


(@+B)xC=@xC+B8xeC [42] 
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and 


Cx (@+8)=Cx@+exB [43] 


It is useful to observe that since the rule for matrix multiplication 
agrees with that for the multiplication of determinants, the determinant 
of the multiple product 


QAxBxex---xg [44] 
has the value 
, AXBXCX:::-XG [45] 


5. SOME SPECIAL FORMS OF SQUARE MATRICES 


A square matrix of the form 
dy, 0 O---0 


[46] 


in which all the elements are zero except those on the principal diagonal, 
is called a diagonal matrix. 

If, in addition, all the diagonal elements are unity, the matrix is re- 
ferred to as the identity or unit matrix. It is designated in this book by 
the special letter U and has the appearance 


U= [47] 
0 0 0 1 
If the matrix of the transformation 1 is U, the equations read 
v1 = 91 
ea [48] 
Xn = Vn 
This set is referred to as the identity transformation, since the old 
variables x; +--+, and the new variables ¥; - - - y, are identical. Any 
matrix either pre- or postmultiplied by %U is equal to itself. 
The unit matrix multiplied by a factor k becomes 
k00---0 
Ok 0O---0 ; 
K=!/0 0 &---0]-: [49] 
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This is called a scalar matrix because any matrix either pre- or post- 
multiplied by K is simply multiplied by the scalar factor k. 

When a matrix is multiplied by the diagonal matrix D, Eq. 46, the 
results for pre- and postmultiplication differ in the following manner. 
The formation of D x @ has the effect of multiplying the rows of @ by 
d11, doo, ++ - dnn respectively, whereas in the formation of @ * D the 
columns of @ are so multiplied. 

A square matrix @ whose elements satisfy the conditions 


OKs = Vix [50] 


evidently possesses symmetry about its principal diagonal. It is, there- 
fore, called a symmetrical matrix. The elements on its principal diagonal 
are, of course, arbitrary. 

On the other hand, if the elements are subjected to the conditions 

ai = —Az [51] 

those on the principal diagonal must be zero. In this case the matrix is 
said to be skew symmetric. 

Interchanging rows and columns has no effect upon a symmetrical 
matrix, but in the case of a skew symmetrical matrix it amounts to mul- 
tiplying by the factor —1. It is, therefore, seen that if both @ and B 


are symmetrical or both are skew symmetrical, @ xX B = & X QG; that 
is, the matrix product is commutative in these special cases. 


6. INVERSE, ADJOINT, TRANSPOSED, RECIPROCAL, AND ORTHOGONAL 
MATRICES 


The rank of a square matrix of order is equal to that of its determi- 
nant. When the rank is ”, the determinant is not zero, and the matrix 
is said to be nonsingular. The matrix is singular if its rank is less than 
. , that is, if its determinant is zero. 

When the matrix @ of the transformation 1 is nonsingular the trans- 
formation may, according to the discussion of Arts. 8 and 9 of the previous 
chapter, be inverted to yield 


birt + Dieve Hees + inn = % 
b2191 + bo2¥2 + +++ + denn = XQ 


bn V1 + breve + eee + Onan =Xy 
The matrix of this inverse transformation 


- by bya + +s | 


Poisae eos [53] 
bat bre rae a 


[52] 
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is referred to as the inverse of @. This is written 
@Qi=8 [54] 


The matrix @ evidently does not possess an inverse when it is singular. 
Tt is clear that all matrices are singular in which the number of rows is 
not equal to the number of columns, because the system of equations 
represented by such matrices does not have unique solutions. A square 
matrix does not possess an inverse if its rank is less than its order. 

A few additional remarks are in order regarding the inversion of a 
linear transformation as viewed from the corresponding matrix equations. 
Writing the matrix equation for the transformation 1 


Qx] = 9] [55] 


one is tempted to solve this for the matrix x] by simply dividing the 
equation through by @. This form of solution is, however, meaningless 
without an interpretation of the operation of division by a matrix. 

Such an interpretation is arrived at if one observes first that the ele- 
ments of the inverse matrix %, Eq. 53, are given by Eq. 48, Ch. I, which 
reads 


ies [56] 


in which A,, are the cofactors of the determinant A corresponding to 
the matrix @. The result expressed by Eq. 55, Ch. I, coupled with a 
recognition of the similarity between the processes of determinant multi- 
plication and matrix multiplication then shows that 


@x@=@?x@=% [57] 


that is, the product of a nonsingular matrix with its inverse equals the 
unit matrix, Eq. 47. Hence it becomes clear that the matrix equation 
55 is solved by premultiplying on both sides by @~", which gives 


Ux] = «] = Gy] [58] 


The operation of division by a matrix is, therefore, interpreted in terms 
of multiplication by means of the inverse matrix. 

Division by a matrix is subject to several restrictions. First of all, the 
matrix by which a matrix equation is to be divided must be nonsingular. 
Furthermore, in a matrix equation of the form 


@xBxe=§ [59] 
it is not possible in a single step to divide out & so as to get a solution 


for @ x €. . 
To obtain this result, a more lengthy procedure such as the following 
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must be used. The equation may first be postmultiplied by C~’, giving 


@x8=8sxet [60] 
Next it is postmultiplied by @~. It then reads 
| @=8xC!x . [61] 
Finally it is postmultiplied by C, giving the desired result 
@x@=8xetxB'?xe [62] 


The same result may alternatively be obtained by first premultiplying 
Eq. 59 by @™ so that it becomes 


Bxe€=@Q'xg [63] 
then premultiplying by B to obtain 
@=8°x@"xg [64] 
and finally premultiplying by @ so as to have 
@xeC=@xB'x@'xs [65] 


It should be observed that obtaining the desired result in this case 
requires not only that @ be nonsingular, but also that either @ or C be 
nonsingular. If this additional condition is not met, the desired result 
cannot be obtained. ' 

Because of these restrictions on the process of division by a matrix, 
it is best not to use the term “ division ’”’ at all. In matrix algebra pre- 
or postmultiplication by an inverse matrix is possible (if the given matrix 
is nonsingular), but the operation of division is said in general not to 
exist. 

By means of. Eq. 56 the inverse of @ may be written in the form 

Ar Aa ||, Ant 


A A A 
Aig Aso | Ans 
@i=|A A A [66] 


ee ee 


A A A 


The process of finding this inverse may be described as follows. First, 
each element in @ is replaced by the quotient of its corresponding co- 
factor and the determinant of @. Second, the rows and columns in this 
resulting matrix are interchanged. This procedure evidently involves 
a large amount of labor when the order of the given matrix is high. 
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There are other methods by which the inverse of a matrix may be found. 
One of these, which is usually considerably shorter than that stated here, 
is described in the next article. Additional methods are discussed in 
Art. 11. 

When the given matrix is of the diagonal form, Eq. 46, its inverse is 
also of the diagonal form and is given by 


[67] 


This result follows from the fact that all cofactors except those for the 
diagonal elements in D are zero, and these cofactors are 


Dye = dy1do2 +++ Oya p—1dkezsega* + * Ann [68] 
while the determinant of D is 
D= d1do9 wate. dnn , [69] 
Hence 
Dik 4 
D 7 ue [70] 


When A™ is factored out of the inverse matrix @-!, Eq. 66, there is . 
left 


[71] 


This matrix is called the adjoint of @. Since the adjoint contains only 
the cofactors of the elements of @, it may exist even when @ is singular, 
but only if the rank of @ is not less than (x — 1) because all the cofactors 
would otherwise be zero. 

According to Eq. 55, Ch. I, the determinant of @~ is A7!; that is, 
the inverse matrix has a determinant which is the inverse of that of the 
given matrix. It is clear from Eq. 71 that the adjoint matrix @? of order 
n has a determinant which is A” times that of @—!. Hence the determinant 
of the adjoint is A”—?; that is, 


Re uias eaear = Ar" [72] 


Ain Aan ee Ann 


It follows that if the determinant of @ is not zero, the determinant of the 
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adjoint is not zero either, whereas if the determinant of @ is zero, that of 
@* is zero also. 

Hence if @ has the rank m, then @? also has the rank n; but if @ has 
any rank less than (7 — 1), the rank of @? is zero because all its elements 
are zero. When the rank of @ is (w — 1), at least one of the elements of 
@* is not zero and, therefore, its rank is at least 1. It can be shown to be 
no greater than 1, as-is further discussed in Art. 8 of Ch. TIT. 

The linear transformation 1 is for some purposes more effectively 
studied when written in the schematic form 


x1 X2 X%3 + +° hn 
Yi} M1 Gig 13 *°* Gin 
Yo | der 22 G23 +++ Gon 
4y3 | @31 G32 433 °° * Gan [73] 
Yn | Gn1 Ong Ong *** Inn 


which places the matrix more clearly in evidence. The Eqs. 1 are easily 
read out of this schematic arrangement by following along the rows and 
mentally dropping the x’s down into their proper positions, supplying 
at the same time the necessary equal and plus signs. It now becomes 
more evident that these equations are closely related to another set, 
namely that which is obtained when the same manner of reading is 
carried out along the columns. Written out in the normal fashion, these 
equations are 


O11 + Gei¥o + ++ + Oni¥n = M1 
Q12¥1 + doeVo + +++ + On2In = Xe [74] 


eee etree ere reer erererensee reser eene 


A1nV1 + Gonye2 tee t Qnnyn = Xn 


With reference to Eqs. 1, they are called the transposed set. Their 
matrix is 


@, = 32 G22 *** Ine [75] 


Qn Gan *** ann 


and is called the transpose (sometimes also the conjugate) of @. The 
transpose of @ is the matrix @ with its rows and columns interchanged. 
In this book the transposed matrix is designated by the subscript ¢ 
and the adjoint by the superscript a, as in Eqs. 75 and 71. 
The transpose of a matrix evidently exists whether that matrix is 
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singular or not and also when the number of rows is different from the 
number of columns. In particular, the transpose of a row matrix is a 
column matrix, and vice versa; that is, 


ea= a] [76] 
and 
a], = |, [77] 


The transposed set of equations 74 should not be confused with the 
inverse set 52. For certain types of matrices the transpose and inverse 
are the same, but not in general. 

In order for the inverse of a matrix to be equal to its transpose, the 
elements of that matrix must satisfy certain special conditions. It is easy 
to establish these conditions. If the elements of the matrix @ are a,., 
those of the transpose @, are a,,, and the elements of the inverse matrix 
@ are given by Eq. 56. Hence the desired conditions are expressed by 


er = ae [78] 


The significance of this result may be made more evident by substitut- 
ing it into the relations expressed by Eqs. 45 and 46 of Ch. I. These 


equations, it is recalled, hold generally for the elements and cofactors 
of any determinant. Substituting Eq. 78 into Eq. 45 of Ch. I gives 


1 fort =k 
Q1401% + Ggidan +++ + + Onidnk = ‘o Pees mp [79] 
Substituting Eq. 78 into Eq. 46 of Ch. I yields the companion relation 
fi fori=k 
GiyAer + Bigdeg +++ + indin = | 0 Pa Xk [80] 


Equation 79 states that the sum of the squares of the elements of any 
column equals unity, but that the sum of the products of the elements 
of any column with the corresponding ones of any other column is equal 
to zero. Equation 80 expresses a similar relation with regard to the ele- 
ments in the rows of the matrix. For example, 


Gy,” + O07 +-++ + Ayn? = 1 
or 
Gay? + gg? + +++ + ang? = 1 [81] 
but ; 

011021 + 12099 + +++ + Aindan = 0 
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or 
Gy2013 + de2de3 + +++ + Gnodn3 = 0 [82] 


and so forth. 

A matrix whose elements satisfy the relation 78, and hence the relations 
79 and 80 also, is called an orthogonal matrix. The term “ orthogonal ” 
suggests the existence of a right-angle relationship. The geometrical 
interpretation which justifies the name for this particular type of matrix 
is given in the following chapter. A simple numerical illustration of an 


orthogonal matrix is 
05 O35 0.707 
@ =| —0.707 0.707 0 [83] 


-—0.5 -—0.5 0.707 


The determinant of the matrix @ is A. The determinant of its transpose 
is also A, but since the transpose of an orthogonal matrix is equal to its 
inverse, the determinant of @ must at the same time be equal to A™. 
Hence the determinant of an orthogonal matrix must satisfy the equation 


1 
A= mt . [84] 
whence 
A?=1 [85] 
and - 
A= +1 : [86] 


The conditions under which the algebraic sign is either plus or minus 
are discussed subsequently. For the matrix 83, the determinant is +1, 
as the reader may readily verify. 

The transpose of the inverse of a matrix plays a sufficiently important 
role in the subjects utilizing matrix algebra to justify for it a special 
name and designation. It is called the reciprocal matrix, and it is desig- 
nated by an asterisk affixed to the corresponding script letter. Thus the 
reciprocal of @ is given by 


 @*=@;7 [87] 


The élements of the reciprocal matrix are similarly designated by an 
asterisk placed on the elements a,, of @. Reference to Eq. 66 shows that . 
the elements of @* are given by 


a*,, = 7 [88] 
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The same result is obtained by calculation of the inverse of the trans- 
pose, so transposition and inversion are commutative. 

The following summary may be helpful at this point: 

For a given matrix @ with elements a,,, determinant A, and co- 
factors A;s: 


. ‘ : A : 
The inverse of @ is @—! with elements vi and determinant A7. 


The adjoint of @ is @* with elements A,, and determinant A”~?. 
The transpose of @ is @, with elements a,, and determinant A. 


: ae Are , 
The reciprocal of @ is @* with elements a*,, = og and determinant 
Att 
If @ is orthogonal, then @; = @7!,a,, = ae »and determinant A = +1. 
Also @* = @; that is, an orthogonal matrix is its own reciprocal. 


In the manipulation of matrix equations it is sometimes useful to note 
that 


(@ x B): = x @: [89] 


that is, the transpose of a product is equal to the reversed product of the 
individually transposed matrices. This conclusion follows from the fact 
that the product (@ x &); is formed by multiplying the rows of @ by 
the columns of & and then interchanging the rows and columns in the 
resulting matrix, which is the same as though the elements had been 
formed by multiplying the columns of & (these are the rows of B;) by 
the rows of @ (these are the columns of @,). For example, the (2,1)- 
element in the resultant matrix (@ x &), is formed from the first row 
in @ and the second column in &; and the (2,1)-element in the resultant 
matrix &, X @; is formed from the second row in &; (second column in 
&) and the first column in Q, (first row in @), which are identical com- 
binations. 

This relationship may readily be extended to multiple products. If 


a@xB=G [90] 
then 
(@Qx 8),=G:=B,.xQ; [91] 
Next if 
@xBxe=8xe —([92] 
then 
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In general, 
(@xXBxCX---XP)e=PiX-+- XOX BX A, [94] 
The transpose of a multiple product is equal to the reversed product of 


the individually transposed matrices. 
A similar relationship also holds for the inverse of a matrix product. If 


ax B= [95] 
then from the fact that 
e€xect= U (unit matrix) [96] 
and 
@xSx@> xa =o [97] 
it is seen. that 
@-! = Q-1q@-1 : [98] 
Hence 
. Gx a)" =a" xq [99] 


The inverse of a product is equal to the reversed product of the individual 
inverse matrices. 
The extension to multiple products follows as before, 


@xXBx€x---xPyt=P41x---xe*x BI xa! [100] 


It is clear that all the matrices must be nonsingular in order for this 
relation to apply, but this fact does not mean that a multiple product 
involving singular matrices cannot in some cases have an inverse. For 
example, the component matrices in the following product are evidently 
singular because they are not square: 


[424] X [B42] = [coe] [101] 
But the resultant matrix is square and may be nonsingular, in which- 
case it possesses an inverse. The relation 99 is not applicable to the - 
product in Eq. 101. 
Similar reasoning shows that an analogous relationship holds also for 
the adjoint of a product of two matrices 
(AX De = B*.x ae [102] 
as well as for the adjoint of a multiple product 
(AX BXKEXK+++ x P)* = P* K++ KC* XX BX Ae [103] 


According to the definition (Eq. 87) of the reciprocal of a matrix, it 
follows from Eqs. 89 and 99 that 


(@ xX B)* = G* x BF [104] 


48 MATRICES [Ch. IT 


The reciprocal of the product of two matrices is equal to the product 
(not reversed) of the individual reciprocals. The extension of this rule 
to multiple products evidently reads 


(@xXxBxex---x PJ* =QA* x BEX C*X--- x P* [105] 


The relations expressed by Eqs. 89 and 99 also show that if P and 2 are 
orthogonal matrices, so that 


9,=9" and 2,=27 [106] 


then 
(Px). = (FP XQ) [107] 


Hence the product of two orthogonal matrices is again orthogonal. 


7. SUBMATRICES OR THE PARTITIONING OF MATRICES 


In carrying out a matrix product, subdividing or partitioning the 
matrices into smaller components is sometimes convenient. In the 
following example 


<—r—} 
G11 2 1 Q43 1 
21 Ag2 | 23 Ho = bal 

@ xX B=| ---------- Milanese ey [108] 
a31 32 33 
G41 a2 i 43 bar 1 Ose bs] 


the so-called submatrices in @ are 


| 41 ez _ | %13 
a1 = a2 = 
421 22 a23 


[109] 


and in & they are 


B iia ied Bg ze: ie A 
bar m bee beg [110] 
Bor = [bs] Boo = [bs2 sa] 
The matrix product, Eq. 108, may be written 
@xQ= [es ps ie ral 
a 21 G29 a Boi Bae ttt] 


and evaluated as though the submatrices were ordinary elements except 
that the order in which they enter into the products formed by this 
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evaluation process must be observed carefully. Such products of sub- 
matrices must, of course, subsequently be carried out by the same rule 
of matrix multiplication. 

When the original matrices are partitioned as is done in Eq. 108, 
the division of the columns of @ into subgroups must be identical with 
the division of the rows of & into subgroups. Thus the subgroup of the 
first r columns of @ must be matched by a subgroup of the first r rows of 
@. This matching assures that all the submatrices appearing in the 
products formed during the subsequent evaluation of Eq. 111 will be 
conformable. 

If the total number of columns in @ are subdivided into more than 
two subgroups, the rows of & must be similarly subdivided. The rows of 
@ and the columns of 8, on the other hand, may be subdivided in any 
desired manner without affecting the conformability of submatrices later 
appearing in the evaluation process. In Eq. 108 the columns of the matrix 
® may, for example, not be subdivided at all or they may be divided 
into three subgroups instead of two. 

It is an instructive exercise to show, by means of the relations 37, 
that the total operations of multiplication and of addition involved in 
_ the complete evaluation of a matrix product are independent of whether 
or how the component matrices are subdivided. This fact alone might be 
regarded as an indication that the present discussion is, for practical 
purposes, wholly irrelevant except that subdivision may be convenient 
in certain special cases. A somewhat different application of the principle 
-of subdividing matrices is, however, very useful practically in the solution 
of a set of linear equations or in the inversion of a matrix. 

The subdivided matrix equation for a linear transformation is indicated 


by 


v1 
1 
G1 A211 413 * Gin x2 
1 
doy 22 1 23 * Gon cas 
wen ew eww eww ew ee ew ewe =e xX: 
P< hie [112] 
a31 Aq , G3 a3n : 
1 
dae anens Von eee cence 
Gn, Ang : Gn3 Gan 
Xn, 
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and 


[114] 
| ¥3 
— {1 = ; 
1 | n2 
Yn 
one may write the transformation, Eq. 112, as 
a11& + aygts = m ; 
11 
ag11 + agebo = neo [ 5] 


A possible method of solution is the following. The second equation in 
the set 115 may be premultiplied by ag27! and solved for é2, giving 


$= aga m2 — a2 oaks [116] 
Substituting this into the first equation of set 115 gives 
(a1 — 2099 *021)#1 = 1y — 12029 "ne [117] 
from which 
£1 = (a42 — aigc227*e21)*(m. — e229 "n2) [118] 
In a similar fashion one obtains the solution for £5, 
& = (agg — agiayy ‘oy2) "(mg — aeiar1 'm) [119] 


As is to be expected, this is simply Eq. 118 with the subscripts 1 and 2 
interchanged. 
Writing for brevity 
01 = (ai, — e122" *e21) 
[120] 


02 = (22 — ag1011 a12) 
the inverse of the matrix @ in the transformation 112 is indicated by 
ee | 01 ‘e209 
@-} = | ------------------------ [121] 
— 6g *a21e4177 | 02+ 


The inversion of a matrix of given order is thus reduced to the inversion 
of matrices of lower order. These are the submatrices a1; and age, and the 
resultant matrices 6; and @ given by Eqs. 120. For a1; and agg to be 
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nonsingular, they must have the same number of rows as columns. 
The matrix @ must, therefore, be subdivided to meet this condition, as 
is done in Eq. 112. Even then it may happen that a1; or ae is singular, 
but this situation can always be remedied by a rearrangement in the order 
of the original equations. 

The submatrices aj, and ag; do not need to be inverted. They may 
have any number of rows irrespective of the number of columns. The 
matrices 6, and 6) are evidently square and of the same order as ay 
and age respectively. The process is, therefore, always applicable pro- 
vided, of course, that @ is nonsingular. 

When the matrix @ has many rows and columns, an extension of the 
method may be developed which allows for further subdivision, or the 
present method may again be applied to some of the submatrices obtained 
from a preliminary subdivision. 

A numerical example may serve to illustrate the advantage of this 
method over the one suggested by Cramer’s rule. Let the matrix be 


Oo atm 
ieee ay oa 

@ =| ---------------- [122] 
ae ee 


For the indicated subdivision, 


2 33 -4 1 
a, 1 -2 a lB 
3-2 eee, 
oar={-2 1f “FL 3 -1 


In this simple case the submatrices 11 and ae2 may be inverted by in- 
spection, giving 


{fF 2. <3 if1 3 
ai = 5/1 = ax =|; x [124] 


Next in order is the determination of 


ie xh Ee 
ria | Oe? ae 3 oO) > Fk a4 


[123] 


and 


-1_! 
21011 aa 7 
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Then 
wonton fA LS otf E] en 
mace AL! SJ (-t ]=1[-2 2] oe 
whence . 
m= Et col- alia aa) a1oy se] 0 
of STs 2] we 
and . 
m= [3 GL Zs -as}-abs te] 090 
w=t[ 5 26] ~siL-25 a] 132 


Finally, 


1756 33]. 17-1 -10] 1758 —14 
—A,~! eae) Weeeeinecte ; =) ee eee 
Or” races al; 3x3 14 vA =| 7 a [133] 


and . 
17 28 7]. 17-8 -5] 17-29 —12 
_— —-1 ed es sect eo 
$a aie sil 28 ax Al 3 | al 32 a [134] 


When Eggs. 130, 132, 133, and 134 are put together according to the form 
indicated in Eq. 121, the desired inverse is found to be 


—56 -33 58 —14 
1) 7 -3 7 16 
~57/-29 -12 28 7 

32 27-258 


@ [135] 


The student should obtain this same result by means of Cramer’s rule 
directly in order to appreciate how considerable a saving in labor is- 
afforded by the use of submatrices. 

The matrix of Eqs. 112 may be manipulated in a variety of additional 
ways which yield slight modifications in the process of obtaining the same 
end result, but there is little point in further discussing this item here 
since the fundamental principle remains the same. 
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When, after subdivision, the given matrix has the form 


_ Qi 0 
a -: E ae 


Eqs. 120 and 121 show that the inverse is given by 


—1 0 
@- = | “11 es | 
E a2 1 


_ 0 a12 
a=[%., 0 ] 


the inversion of Eq. 115 shows that 


1_|9 21 
a = Pe 0 


On the other hand, if 


53 


[136] 


[137] 


[138] 


| [139] 


These simple examples are readily generalized, with the result that if 


a1 0 ac 0 
ene 
0 0 Onn 
then 
11 - 0 0 
@-1 = 0 a9 + 0 
0 0 Onan 
and if 
0 ¢ 0 Qin 
Q= 0 * A2,n—1 0 
Qn) * 0 0 
then the inverse is given by 
0 ...0 Ga 
q@-1 = 0 ss @ tne 0 
ayn) - 0 0 
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[140] 


[141] 


[142] 


[143] 


In Art. 1 it is shown that if the variables ; - - - y, in the transformation 
1 are subjected to the further transformation 14, the matrix which com- 
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bines the two transformations into one is given by the matrix product 
expressed by Eq. 20. In this process the matrix @ is said to be linearly 
transformed into the matrix C. 

The matrix &, by means of which the transformation of @ is effected, 
may sometimes advantageously be thought of as the resultant of a 
multiple product of very simple component matrices; thus 


B= BrX-++- X Bs X Bo X By [144] 


The transformation of @ is then considered to be accomplished through 
the succession of a number of simpler transformations, the first of which 
is effected by 8, the second by 8s, and so forth. 

A matrix may be transformed by postmultiplication as well as by 
premultiplication, in which case the matrix effecting the transformation is 
decomposed into components according to the order indicated by 


B= Bi X Ba X Ba X--: KX Ba [145] 


Here &, accomplishes the first simple transformation, B, the second, 
and so forth. 

’ Tt is of interest to determine the simplest fundamental forms into 
which an arbitrary nonsingular matrix & may be decomposed, and 
to interpret the individual transformations which they effect upon the 
form of a matrix @. They are called elementary transformations, and the 
matrices which produce them are the elementary transformation matrices. 
There are three types of elementary transformations, and, correspond- 
ingly, there are three fundamental types of so-called elementary trans- 
formation matrices. 

The first type of elementary transformation amounts to an interchange 
of any two rows or columns of the matrix @. The second type is the addi- 
tion of the elements of a row or column to the corresponding elements of 
another row or column; and the third type is the multiplication of any 
row or column of @ by an arbitrary nonzero factor. Each type of trans- 
formation may be effected through multiplying @ by a corresponding 
typeof transformation matrix. If the desired transformation is intended 
to affect the rows, @ is premultiplied by the transformation matrix; if 
the columns are to be affected, @ is postmultiplied by the transformation 
matrix. 

Each type of elementary transformation matrix is formed from the 
unit matrix U, Eq. 47, by performing upon it the same elementary 
transformation which the desired transformation matrix is intended to 
effect in the matrix @ by means of pre- or postmultiplication. Thus UW 
with its pth and gth rows (respectively columns) interchanged yields a 
transformation matrix (type 1) which, by means of pre- (respectively 
post-) multiplication, interchanges the pth and qth rows (respectively 
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columns) of @. The matrix U with its gth row (respectively column) 
added to its pth row (respectively column) yields the transformation 
matrix (type 2) which, by means of pre- (respectively post-) multiplica- 
tion, effects the addition of the elements of the gth row (respectively 
column) of @ to the corresponding elements of its pth row (respectively 
column). Finally, the matrix U with its pth row (respectively column) 
multiplied by a factor k yields a transformation matrix (type 3). which, 
by means of pre- (respectively post-) multiplication, multiplies the 
elements of the pth row (respectively column) of @ by &. 

Observe that each type of elementary transformation matrix has twa 
forms according to whether it is intended to effect a transformation of the 
rows or columns of @ (by pre- or postmultiplication, respectively). The 
one form in each case is evidently the transpose of the other, and in types 
1 and 3 these two are readily seen to be identical, whereas in type 2 the 
transposed matrix merely interchanges the distinction between the 
designations p and q in the description of the preceding paragraph. 

The elementary transformation matrices are in this text denoted by 
the script letter J with subscripts inffnded to indicate the type. Thus 
TJ p~q is used to designate type 1; Jp4, designates type 2 for the trans- 
formation of rows, whereas Jp49) is the designation of this type for the 
transformation of columns; and J,,,, denotes the transformation matrix 
» of type 3. 

These are illustrated for the case of fourth-order matrices by the 
following examples: 


00 1 0 
010 0 
Sing = 100 01> (Fins)e (type 1) [146] 
0001 
100 0 
1 1 0 
To13 = 4 i ol (Fo4s1)¢ (type 2 for rows) [147] 
000 14° 
1.00 0 
Tors, = : i : : = (Seis): (type 2 forcolumns) [148] 
00 0 1 
100 0 
0 100 
S3xk = 008 Om (F3xn)e (type 3) [149] 
0001 
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The transformation of a fourth-order matrix @ by means of these is 


illustrated in the numerical examples. given below: 


00 1 O} } 2 


— 
ote 


aS | 
x 
— 
OPP NM OF OO 


1000 
0110 
Ta1sXG=!q 9 1 9 
0001 
243 6)/f1 
15 8 7\l0 
@xTasi=| 4 2 6 9/Xl0 
i0 5 3 1] lo 
1000] f2 
010 0\.| 1 
S35) XA = 0050 x 4 
000 1} {10 
243 6] f1 
15 8 7\l|0 
@xFex@=! 4 2 6 9/*l0 
10 5 3 1] [o 


e 
CSCOFOSO MANU OF RPO UnuUh COOrFRO UNnume 


SCUDD WODOW COHOD WAWMH DOOR waww 


RPOoOoO FOND RP OOO KF ONA KP OCO KONO 


4 
1 


S dn koe Se 


— 


—y 


em dO 
PrRdHS CORN OPEN 


—_ 
i=) 


ads oe 
Mba > a em U1 bo 


— 


nNoMNP COBWN 


5 
2 
5 


_ 


COPF ND WW OOD 


— 


WADW WAR W 


RB ONAN SB AATO 


= 
Pon 


moO 


RB OWAD 


[150] 


[151] 


[152] 


[153] 


[154] 


[155] 


The transformation matrix of type 2 may be generalized from the 
form Jp4, to Jp. (with reference to either rows or columns), which 
allows the gth row or column to be subtracted from as well as added to the 
pth row or column. It is useful to note that for this type 


Tosa = TF otp! 


and since 


Fosa)t = Spsal 


it follows that 


(Frade = T asp 


and 


(Fait = SF oti 


[156] 
[157] 
[158] 


[159] 
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For the matrices of types 1 and 3 it is clear that 
(Tpng)t T png [160] 
and 
Toxn)t = Sone [161] . 


It is also useful to note the form of the inverse of each type of trans- 
formation matrix, thus 


Tong) = S pra [162] 

Snag) = Spzq . (row or column) [163] 
and _ 4 

Toxk = Tpxa [164] 


According to Eqs. 160 and 162, the transpose of the matrix of type 1 
equals its inverse. This type of transformation matrix is therefore 
orthogonal. 

In the transformation of matrices it is useful to have a type of trans- 
formation matrix which effects the addition (or subtraction) of the 
k-multiplied gth row or column to (or from) the pth row or column in a 
single operation. Such a matrix is evidently the following combination 
of types 2 and 3: 


Tse = Sax? X Spa X Saxe [165] 
or 

Tpkal = Saxe X Sptai X SF axk [166] 
according to whether the rows are to be affected by premultiplication or 
the columns by postmultiplication, respectively. The matrix Tpizq or 
TS p4ka\ is formed from the unit matrix U by adding (or subtracting) its 


k-multiplied gth row or column to (or from) the pth row or column, 
respectively. It may be referred to as type 23. It has the properties 


Tpsra)t = Srskal [167] 
Tpikqg = JS pziq -—s (row or column) [168] 


A numerical illustration of the use of this matrix is 


100017243 6/72 4 3 6 
014 0|.11 58 7| |17 13 32 43 
ToraXA=19 9 1 oI 426 9)71 4 2 6 9| LO 
0001] |105 31] |t0 5 3 1 
243 6] 71000] [2 636 
158711010 0] |] 13787 
QxTs+m3i=| 4 2 6 9|/*%10 4 1 Ol | 4 26 6 9 [170} 
10 5 3 1| |0 0 0 1) {10 17 3 1 
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According to the theory of determinants, it is clear that the trans- 
formation matrices have determinants with the following values: 


(type1) Tprg= —1 [171] 
(type2) Tpig= 1 (row or column) [172] 
(type3) Tna = & [173] 
(type 23) Tring = 1 (row or column) [174] 


Hence transformation of the matrix @ by means of types 2 or 23 leaves 
the value of the determinant A unchanged; transformation by means of 
type 1 merely changes the algebraic sign of A; and transformation by 
means of type 3 has the effect of multiplying the determinant A by the 
factor k. 

It should be observed that an elementary transformation matrix J 
is always square (has the same number of rows as columns), but @ need 
not be. However, the order of J must be so chosen that the matrices are 
conformable. If @ is not square, the correct order of JS is different in the 
cases of pre- and postmultiplication. 

The three types of elementary transformation with the matrices 
Ty~q Fpiq, and Tp x, if repeatedly applied to an arbitrary matrix @, 
are capable of yielding the same end result as is expressed by the relation 


PxAxXQ=e [175] 


in which # and 2 are any nonsingular matrices. This is the same as saying 
that any nonsingular matrix such as P or 2 can be represented as a 
product of suitably chosen elementary matrices of types 1, 2, and 3 
- alone. 


9, EQUIVALENCE OF MATRICES 


A matrix € which may be obtained from a matrix @ by means of a 
finite number of elementary transformations is said to be equivalent to @. 
The equivalence of matrices is evidently a mutual relationship, since the 
elementary transformations are nonsingular and hence reversible. Thus 
if C can be obtained from @ by a succession of elementary transformations, 
it follows that @ can be regained by means of elementary transformations. 
The state of equivalence of two matrices @ and C may evidently be ex- 
pressed by Eq. 175, in which and 2 are arbitrary nonsingular matrices. 

It is important for later applications to recognize that equivalent 
matrices have the same rank. The truth of this statement may be seen to 
follow from the fact that the rank of a matrix depends upon properties 
of the corresponding determinant (the vanishing or nonvanishing of this 


e 
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determinant and its minors), which, according to the theory of determi- 
nants, are not altered by the elementary transformations. 


10. TRANSFORMATION OF A SQUARE MATRIX TO THE DIAGONAL FORM 

A square matrix possesses equivalent diagonal forms, any of which 
may be obtained through a linear transformation having the form of 
Eq. 175. Such a transformation may be written 


PxXAxQ2Q=9 [176] 


® being an equivalent diagonal form of @. 

The matrices # and 2 which accomplish the transformation can be 
formed from components of the combination type 23 alone (see Art. 8), 
in which case @ and D have the same determinant. 

The detailed process is best shown by means of a numerical example. 
The one given in Art. 2, Ch. I, illustrating the numerical evaluation of a 
determinant, involves precisely the elementary transformations required 
here. The given matrix @ may be assumed to have the determinant A 
expressed by Eq. 4, Ch. I. The matrix # consists of components of the 
type Jpinq, their formation being described in Art. 2, Ch. I, by steps 
1, 2, and 3. Since premultiplication is required by these transformations, 
the vorder of the components referring Eespeevey to these steps reads 
from right to left, thus 


1 00 {00 100 1 00 
g=/0 1 O|x} 01 O|x}-4 1 O]=|-4 1 Of [177] 
OSS ee Tes Or A 001 $044 


Third step Second step _ First step 


Similarly, the matrix 2 is given by the product of components of the 
type Jpskq\, their formation being described by steps 4, 5, and 6. Here 
postmultiplication is involved. Hence the order of the component matrices 
corresponding respectively to these steps reads from left to right. The 
desired matrix Q is, therefore, given by 


1 —-3 0] [1 0 -2] fi 0 0 a 
a=-|0 1 0lxlo 1 o;xjo 1 —]=]0 1 —4] [178] 
0 o1| [0 o 2) 1o-0 1) {0 -0 1 


Fourth step _ Fifth step Sixth step 
The transformation expressed by Eq. 176 reads 


1 oO] f1 3 2] f1 -3 —F] [1 0 0 
-4 1 0|xl4 2 6|x}o 1 -#]=|0 ~10 0 [179] 
4-4 1] [3 1 7] lo o- 1) [oO 0 48 


The determinant A is equal to the product of the diagonal elements. 
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It is obvious that, by means of elementary transformations of the type 
Tz, this resultant diagonal matrix may be made to go. over into the 
unit matrix of like order. 

It is also possible to transform a matrix @ to the diagonal form by pre- 
multiplication or postmultiplication alone. For example, after the matrix 
is reduced to the triangular form in the manner described above, one has 


1 00 1 3.2 i “3 2 
x@=|-4 1 0|x{/4 2 6|=]0 -10 -2] [180] 
: oo ae 317} |o o 3 


The elements above the principal diagonal in this triangular matrix may 
now be reduced to zero by continuing the process of premultiplication 
with transformation matrices of the same type as the ones contained in 
Y, Eq. 177. Thus the fourth step may be the following: 


10 —2 1 3 2 1 3 0 
01 o|{x{lo -10 —2}=]0 -10 -2] [181] 
00 1 0 o 4}; Lo o 3 


The fifth step becomes 


10 0 1 3 «60 1 3 0 
0 1 318/x/O0 —10 -2}=]0 —10 0 [182] 
00 1 0 o 42 0 0 43 


and the sixth and final step reads 


1 3 0) fi 3 0 1 0 0 
0 1 o|xlo -10 0 |=jo -10 0 [183] 
oo 1} lo o wl] jo o 


The resultant transformation matrix combining the last three steps 
becomes 


1 3; 0 10 0 10 —7% 1% -i% 
2=10 1 O}]x;O0 1 +8 |xj0 1 0 |=|0 1 49] [184] 
00 1 00 1 0 0 1 0 0 1 
Sixth step Fifth step Fourth step 


which is, of course, different from the matrix 178, although not unlike 
this matrix in form. The complete transformation is accomplished by 
premultiplication of @ by the resultant transformation matrix: 


1 ~ —-i 1 00 
R=Q2XF=10 1 481x}-4 1 0 
0 0 1 h -4 1 
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In place of the transformation expressed by Eq. 176, one now has 
Rx@=D [186] 


It should be clear that a similar procedure, involving postmultiplication 
only, can also be used to accomplish the same result. The reader may 
carry out the detailed steps as an exercise. 

In the above numerical example, the rank of the given matrix is equal 
to its order; that is, the matrix is nonsingular. When the rank of @ is less 
than its order, some of the diagonal elements in D are found to be zero, 
but the method of transformation given in Eq. 176 is still applicable. 

From the definition of the rank of a square matrix (which is the same as 
the rank of its determinant) it is readily seen that if the rank of a matrix 
@ of order 1 is r = (nw — ), then p of the diagonal elements in D be- 
come zero. It is then possible to arrange the remaining elements so that 
® has.the appearance 


dy, 0 O 0 
0 dx 0 0 
D=|0 O +++ dy +++ O [187] 
0 O--- 0 O-:--0 
0 0 0 
By a further transformation, the following form may be obtained: 
100 0 
0 1 0 0 
oer ee ee ewe eee ewe eee 4 TOWS # 
SSa0 Oecd axe [188] 
00 --- 0O-:-0 
0 0 0 


Since the matrix D is symmetric, it may be desirable to find & by a 
transformation of the form & = ?,DP. The matrix will not be real, 
however, if any of the diagonal elements in D are negative. The form of 
& given by Eq. 188 may nevertheless be obtained using only real matrices 
either by pre- or postmultiplication alone, or by a transformation like 
Eq. 176 in which # and 2 need not be specially related. 

This diagonal matrix (in which the first r diagonal elements are unity 
and the rest zero) is referred to as the canonical form of the square matrix 
@ of rank r. The diagonal and the canonical forms of a matrix obviously 
place its rank in evidence. 

A square matrix whose rank is less than its order is said to be de- 
generate. If the rank is r = (m — p), then p is spoken of as the degree of 
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degeneracy of the matrix or also as the nullity. The latter term is evi- 
dently suggested by the forms given by Eqs. 187 and 188. 

When the matrix @ is symmetrical, the matrix # which enters the 
transformation to the diagonal according to the process indicated in 
Eq. 176 can evidently be equal to the transpose of 2,* so that this equa- 
tion may be written 


92xXxaxQ2Q=D [189] 


The matrix 2 can have a variety of forms, depending upon the values 
of the diagonal elements appearing in 9. For certain values, called the 
characteristic values or also the latent roots of the matrix @, the trans- 
formation matrix 2 in Eq. 189 becomes orthogonal. This orthogonal 
transformation is discussed in detail in the immediately following chap- 
ters, where it is also shown that the corresponding values of the diagonal 
elements in D are the roots of the so-called characteristic equation, 


(a11—A). Qjg  *t* Gin 
G21 (da2—d) +++ Gan =0 [190] 
ani ang ° (dnn—) 


The left-hand side of this characteristic equation, which is formed by 
setting the determinant of the matrix @ — \%U equal to zero, is evidently 
a polynomial in d of the mth degree. It is clear that the constant term in 
this equation (the term without \) is equal to the determinant of a. 
Moreover, the coefficient of the highest-power term is evidently (—1)". 
If, then, Eq. 190 is divided through by (—1)”, and if the m roots are 
1, Ae, °° * An, it follows from the theory of algebraic equations that 


A=hy Xo Kes Kn [191] 


The determinant of 9D is, of course, also given by the product of the roots 
M1, Ae) ° +: Any these being the diagonal elements in D. This result checks 
with the fact that the determinant of an orthogonal matrix is +1 (see 
Eq. 86), so that the determinant of D in Eq. 189 is equal to the determi- 
nant of @.’One thus recognizes the invariance of the determinant of a 
matrix to an orthogonal transformation of the form of Eq. 189. 


*It is significant that is not necessarily the transpose of Q when @ is symmetrical. Thus 
if the first operation upon the rows of (f effects an interchange in any two rows, the matrix 
encountered at the next step is no longer symmetrical. Symmetry can, of course, be restored 
by subsequently performing a similar operation upon the corresponding columns, but such 
a step is not necessary, and if it is not taken the matrices and Q ultimately obtained will 
obviously not be each other’s transpose. 
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11. ADDITIONAL METHODS FOR OBTAINING THE INVERSE OF A MATRIX 


Having determined the matrices and 2 which accomplish the 
reduction of a matrix @ to its equivalent diagonal form, according to 
Eq. 176, one may proceed to find the inverse of @ by means of the follow- 
ing reasoning. The first step is to form the inverse of both sides of Eq. 
176, making use of the property expressed by Eq. 100. This gives 


9 Sa Ke D [192] 
Premultiplying by 2 and postmultiplying by ?, one has 
@t=2xD' xe [193] 


Inasmuch as the matrices 2 and # are already known, and the determina- 
tion of D7! according to Eq. 67 is a relatively simple task, the formation 

- of @! by means of the result expressed by Eq. 193 involves only a nominal 
amount of additional computation. For the numerical example given in 
the preceding article (see Eqs. 177, 178, and 179) the Eq. 193 yields 


di. 2237 228 1 0 0 1 00 
@i=|o0 1 -#/x]o -+% o|]x]-4 10 
0 0 1 0 0 Fohe4 


—4. 95 —7 
=7,, 5 -05 -1]| [194] 
1-4 5 


Alternatively one may begin with Eq. 186 and form its inverse, which 
reads 
Ct xR = a" [195] 


Postmultiplication by R again gives the desired result, namely 
Qt1=DIxKR [196] 


For the same numerical problem as discussed above (see Eq. 185) this 
reads 


» If @ is reduced to the diagonal form by postmultiplication alone, that 
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is, if one has found a matrix 5 such that 


@xs=9 [198] 
then a similar manipulation as given above shows that 
@Qt=85x QD"! [199] 


Another method of matrix inversion, which in its essential steps utilizes 
the reasoning involved in the formation of the matrix # in Eq. 177, is 
found to be exceptionally effective in numerical work.* The procedure is 
best described by considering the equivalent problem of solving a set of 
” simultaneous equations involving m unknowns, such as 


Gy1%1 + Aye%_ +++ + + Ginkn = M 
Gg1X1 + AgeXg + +++ + Gonkn = Yo [200] 
OniXy + an2%2 fee + Annkn = Yn 


Any other set of equations which are linear combinations of these 
(called an equivalent set of equations) clearly has the same solutions. 
Since only the values of the coefficients a,, and those of the quantities 
1, +++ Yq are significant in the formation of such linear combinations, it 
is expedient to carry out the contemplated manipulations upon the matrix 


Qj, AQqs'* Gin NM 
Qo, G22 °** Aon Ye [201] 
Qny Ono °° * Gin Ya 


which is the matrix @ of the set of Eqs. 200 with the quantities y, --- Yn 
included as an additional column. This resultant matrix involving n 
rows and ” + 1 columns is referred to as the augmented matrix correspond- 
ing to the Eqs. 200. 

In order to simplify the notation in the following discussion it. is 
effective to write the symbols @1,n41, d2,n41, °** @n,n41, in place of 1, 
Yo, ** * Yn, 80 that the augmented matrix assumes the form 


Gy, G2 °** An A,n+1 
do, Ge2 *** Gan Gea,n41 [202] 
Qny On2 °° * Onn Fn 


The series of manipulations which are described presently, have for their 
objective the derivation of an equivalent set of equations having an 
augmented matrix in which all the elements of the principal diagonal 

*This method is described in a paper by Prescott D. Crout entitled: “ A Short Method for ~ 


Evaluating. Determinants and Solving Systems of Linear Equations with Real or Complex 
Coefficients,” A.I.E.E. Transactions, Vol. 60, 1941, pp. 1235-1240. 
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are unity and all those below this diagonal are zero. It is readily appreci- 
ated that once this special set of equivalent equations is determined, the 
desired solutions may be written down by inspection. 

The first step is to divide the elements of the first row in the matrix 
202 by ai1 so as to obtain 


1 %2 ds | Q n+l 
ay, 1 a1 
203 
G21 422 43 @2,n+1 [ ] 
“Qn1 Ono Ong °° * Onnqi 


This step requires that a,; have a nonzero value. If this condition is not 
fulfilled it can always be met by a rearrangement of the original equations. 
The a;-multiplied elements of the first row are now subtracted from the 
corresponding elements of the second row, giving 


1 is | tant 
1 Ai a1 
O bee bag +++ bangs [204] 


Cr 


Qnr Gng Ong *** On,n+i 


in which 
421412 
bee = deg — —— 
a1 
421413 
beg = d23 — —— [205] 
a1 
219% ,n4+1 
be n-41 = don 
ayy 


Next the a3;-multiplied elements of the first row are subtracted from the 
corresponding elements of the third row. This leaves the augmented 
matrix in the form 


1 2 %3 | | Fn 
a1 Ay ‘1 


0 boo bes vane benti [206] 


ee 
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with 
_ @31412 
bse = 32 
a1 
431413 
bsg = @33 — —_ [207] 
a1 
- 43194 n4t 
bs nt = Q3.n41 — 
a1 


The procedure is continued until the matrix assumes the form 


1 M2 93 | Ant 
1 1 a1 
0 bes bos Spe be ni [208] 
0 bso bag bg n4t 
0 bn baz Bani 


A single formula for the coefficients b,, is recognized to be given by 


bez = Usk — weitte [209] 
a1 
Now the elements of the second row are divided by boo. (A rearrange- 
ment of the last 2 — 1 rows will be required if b22 happens to be zero.) 
One then has 


, a2 Gs | Ant 
Q1 G1 @41 


boo gg [210] 


ey 


The portion of this matrix exclusive of the first row and column is now 
dealt with in a manner which is identical with that just described for the 
transformation of the matrix 203 into the form of 208. In the first step 
the b3o-multiplied elements of the second row are subtracted from the 
corresponding ones of the third row. Next the b42-multiplied elements of 
the second row are subtracted from the corresponding elements of the 
fourth row, and so on. This set of operations finally yields the matrix in 
the form 
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y oe ise nt 
ay G1 Gy 11 
bog Ob 6 
0 1 a a eda 0 a 
22 U2 22 [211] 
0 C3334 C3 ,n+1 
0 Cag Cag C4,n4+1 
0 0 Cn3 Ong 2 * Cn ntl 
in which the coefficients c,, are given by the formula 
Bsob 
Cok = Dek — as [212] 
22. 


The portion of this matrix involving the coefficients ¢,;, is now treated 
in the same manner as described for the original matrix 202, and as 
repeated in the manipulation of that portion of the matrix 208 involving 
the coefficients b,x. The resulting matrix then assumes the form 


1 M12 %3 %4 | Unt 
410 G1 G1 a1 
bos bog be nti 
Oe, a eee ee 
bee baa boo 
oo 1 & €3,nt1 [213] 
633 €33 
0 0 0 44 ds n+l 
0 =O O dna dy n+ 
me dex = Cop — EOE [214] 
€33 


The continuation of this process is obvious. In the case n = 4, for 
example, one arrives at the desired form 


1 m2 M3 M4 As 
Qyy G1 Ay Ay 


1 ba boa Bo 


0 aia 
bee bee boo [215] 

Ot 4 At 

C33 (C33 

0001 % 


das 
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The solutions to the equivalent equations having this augmented matrix 
are easily recognized. They are given by 


_ U5 
= 
das 
_ 5  €34 i 
3 = a 
C33 C33 
[216] 
= bos bos bes 
se a Pa a 
22 22 22 
i 415 414 “13 a Xo 
ao ts Soe 
ay ayy a1 ay 


In order to systematize the computational procedure it is expedient 
to record the following “ auxiliary matrix ”: 
G2 3 G4 15 
Oi ee a a 
Qyy. G11 Ay Ay 


bos bog bas 


a2, boo 
beg dae bee 
b C34 «= 85 [217] 
a31 32 033 a re 
33 3 
d45 


From the recursion formulas 209, 212, 214 which may be rewritten in 
the form 


: Qs191k 
bex = as, — 
411 
As1%k bsador 
a1 22 
43101% ODgobon  CoaCar 
dst — as ja ORAL ET ce" ge eee an ae eG 


a1 bee C33 


it becomes clear that the elements in the auxiliary matrix can be cal- 
culated and recorded in alphabetical sequence, with the recording of the 
elements of any column preceding that of the elements of the correspond- 
ing row. 
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Although only the elements above the principal diagonal are needed 
in the final computation of the unknowns, as is shown by the Eggs. 216, 
it is necessary to calculate and record also the elements on and below the 
principal diagonal since their values are needed in the sequence of com- 
putations which determine the auxiliary matrix as a whole. This fact is 
readily appreciated from an examination of the recursion formulas 218, 
or better still through an application of them to a numerical example: | 

Since only those results are recorded which are needed in subsequent 
calculations, the computational scheme is orderly and compact. It is 
also significant to mention that the calculation of any one of the elements 
of the auxiliary matrix, in the sequence described above, may be accom- 
plished through a single continuous operation on a modern computing 
machine. Thus, from the standpoint of minimizing the required opera- 
tions, the present procedure for solving simultaneous equations is excel- 
lently adapted to the available computing facilities. 

Regarding the problem of inverting the matrix of Eqs. 200 it is helpful 
to make the following preliminary observations. Suppose, in this system 
of equations, that all but one of the quantities ,.- - - y, are zero, and that 
the nonzero one, which may be any ¥,;, has the value unity. The par- 
' ticular solution corresponding to this choice of y-values may conveniently 
be denoted by 214, X24, °° * %ng. Since the index k may have any value 
' from 1 to ”, there are such sets of particular solutions. It should now be | 
recognized that the set of x-values for & = 1, when arranged in a column 
in which the order of numerical subscripts reads from top to bottom, 
represents the first column in the desired inverse matrix; the set for 
k = 2 represents the second column in the inverse matrix, and so forth. 
Thus the complete array of ? quantities x, are the elements of the 
inverse matrix, with the customary denotation of the indexes s and &. - 

The process of determining the inverse matrix is thus seen to require 
the evaluation of ” sets of simultaneous solutions. This process does not, 
however, require ~ times the computational labor involved in obtaining 
a single solution inasmuch as only the last column in the auxiliary matrix 
differs in the procedure for obtaining each solution. The work may be 
arranged so that all the solutions are evaluated in one continuous 
sequence of operations by recording an auxiliary matrix with » varieties 
of “‘ last ” columns written side by side. The details of such a procedure 
are best illustrated by means of a numerical example. 

Choosing the same matrix as is used in the numerical illustrations of the 
preceding article, one begins by considering the following augmented 


matrix 
yi 3 2 1 0 0 
426010 [219] 
3170041 
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in which the last three columns represent three varieties of the column of 
y-values as given in the matrix 201. The first of these three columns 
represents the values y,; = 1, ye = y3 = 0; for the second of the last three 
columns, yo = 1, ¥, = y3 = 0; and for the third, yg = 1, ¥1 = yo = 0. 
Regarding all the elements of this augmented matrix as coefficients a,, 
in the recursion formulas 218, one obtains the auxiliary matrix 


D3. 2 20:0 
Ai OG ee ee, I. [220] 
Ss 7B Nas —is v3 


Only the element values above and to the right of the elements on the 
principal diagonal (beyond the dotted line) are now used to determine 
the values of the quantities x... The fourth, fifth, and sixth columns are 
regarded as “ last ”’ columns in computing values of x, for k = 1, 2, 3 
respectively. For k = 1, one finds 


pk 
31 = 73 
%1 = 0.4 — 0.2%31 = al [221] 
13 
= 1 — 2x3; — =- > 
*11 431 — 3x21 13 
For k = 2, 
es 
a a 
1 0.5 
a2 = — 797 0.2%32 = — B [222] 
9.5 
= 0 — 2x32 — =—— 
*12 X32 — 3x22 13 
and for k = 3, 
eg 5 
33 = 73 
X23 = 0 - 0.2x33 = [223] 


13 


X13 = 0 — 24x33 — 3x03 = ~B 
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Hence the desired inverse matrix is 
11 Xie Xy3 1 —4 95 -—7 


Qo = %q1 X%q2 X93 = 3 5§ -05 —-1 [224] 
X31 X32 %a3 1 —4 5 


which agrees with the results given by Eqs. 194 and 197. 


PROBLEMS 
1. Write the following as a matrix equation: 
P = ert, + ente + esis + Cate 
_For example, 
51=2X5-3X7+6xX9+4+4xX2 
2. Write the four scalar equations: 
1 = 91 v2 = y2 %3 = ¥3 X4= 9a 


as a single matrix equation in at least four different ways. 
3. Express as matrix equations the relations: 


af aA, 
te See A Oe 
fors = 1,2,3 and = 1, 2, 3. 
4. Evaluate: 
14 -—2 3 6 4 
L 6} +L 1 o1-[-2 7] 
and 


1 6 2 2 9 -6 
6] 4 3 -5|-4] 4 -7 10 
7-9 1 20 8 -—5 


5. Carry out the following triple product in two ways and note the difference in 
total numbers of multiplications and additions: 


1 3 
a eh 6 
a7 es ee ae ee 
-4 10 62 


and 


7 2 -—10 11 1 
—8 4 3ix]1 1 1 
1-6 #7 111 


72 MATRICES (Ch. IT 


7. Given: 
1 0 0 cos8B 0 ~—sin8 cosy siny 0 
a=|0 cosa sina b= 0 1 0 c=]|-siny cosy 0 
0 -—sina@ cosa sinB 0 cos 8 0 0 1 


Show that these are orthogonal matrices. Also form the products ab, ac, bc, and abc, 
and show that they too are orthogonal. Note that ab # ba, ac = ca, etc. 


8. In the evaluation of the following multiple product: 


2 1 
| -]xe 6 aix|-2] x0 2] 
5 5 


determine that association which requires the least number of multiplications and 
additions. 
9. Evaluate: 


2 -6 4 a0o0 a0o0 2 -6 4 
3 9 7|xI{O 46 0 and 0 6 O}XK}3 9 7 
1 5 8 00 «¢ 00 « 1 5 8 


10. Determine the most convenient ways of evaluating the following multiple 
products: 
1 - 2 6 6 1 5 
E il - [-3 7] x[s 3] x[7] 
1- 2 6 6 1 
we ax[$ “4]x[3 o]xLe 3] 

11. Evaluate: 

1 2]? 1 273 1 2]-1 1 2|7-? 

4 3 4 3 ‘4 3 4 3 


100 1 0 0O 
0 2 O0|X/]0 05 0 
00 5 0 0 02 


12. Using the indicated partitioning, find the inverse of the following product: 


26:0 0 —1 2:0 O 
17!'0 0 3 -4'0 0 
wwe een ee xX weee eee ew = HH = 
00,5 4 0 o;,5 +2 
00!3 1 0 O!7 -6 


13. Given the nonsingular matrix: 


1 O -2 
@=)4 3 #5 
0-6 8 


find the adjoint @* and evaluate the products @ x @* and @* x @. Repeat with the 


matrix: 
1 3 
=| 2 -s|x[t -2 3 
—4 6 


Ch.. 17] PROBLEMS 73 


Discuss the results, showing why their particular forms should be expected. 
14, Using the matrices @ of the preceding problem, determine matrices P and 2 
which will yield diagonal forms in each of the following applicable relations: 


@Px@xQ=D (ii) PxA=D iii) @xQ=aD 


In terms of these results, find the inverse of the nonsingular matrix of Prob. 13. 
15. Given: 


1 20 -!1 

4 -3 2 1 

@ = 6 21 4 
—3 0 5 7 


find @-!, using the method of partitioning discussed in Art. 7, and again by the 
method involving the “‘ auxiliary matrix ” as discussed in Art. 11. 


16. Prove directly from the definitions of the sum and product of two matrices that 
@x(Bxl) =(@ x B) x€ 


- and 


@x(B+C) =(@ x B) + (A xC) 
17. Compute the third power of the matrix 
f= 2° 4 
. a4, 3" 20% se3 
@ = 2 0 9 -6 
1-3 -6 19 


and check the value 13824 of the determinant of the resultant matrix. 


18. If @ is a nonsingular matrix, and the expression @— is interpreted as being 
equivalent to (@-1)*, show that @-" is unique and, in particular, that 


1 0-:-0 
Qo = 0 1-:-0 


ee ey 


19. The row matrix [1 ye ys +++ yn] may be regarded as equivalent to a square 
matrix having these elements in its first row and zeros in the remaining rows. With 
this interpretation of the row matrix, show that 


{1 ye yas yn]™=[1 yo yar? Yn] 
and more generally that 
[yr votes an)™ =n" D1 Yar?s Mal 
Analogously, find for the column matrix that 
yi” ya 
y2 Ye 
: = yt | 


Yn Yn 


74 MATRICES (Ch. I 


20. The expression 
P(X) = Qo + GX + AC? fee: + @,9C" 


in which Qo, Q@,,--- @, are matrix coefficients and SC is a matrix playing the role of a 
variable, may be regarded as a matrix polynomial. A less general form of matric 
polynomial is 

Q(X) = poU + pil + pr? +--+ + pum 
in which fo, p1,+** pn ate scalar coefficients and °U is a unit matrix having the same 


order as 2. 
(a) Show that the equation 


ax +B =0 


in which 9C, @, and & are matrices, has one solution only if @ is nonsingular. 
(b) In the equation 


A,M" + Ay 1H +++» + GiXK + Ao = 0 
let 
2 =[a1 xe --+ xn] 
the coefficients @ being matrices. Show that a solution is given by the expression 
X = — (Qyxy 1! + Aaa”? +--+ +1)" 


provided the matrix in the parenthesis is nonsingular. 
(c) Consider the polynomial 


Y= X24 8KN+C 


~~ = es zt 
%21 ae 
Show that a solution to the equation Y = 0 (null matrix) exists only if the following 
conditions are fulfilled 
X11? + Xy9%e1 + biivis + biexer = —Cis 
(11 + xe2 + d11)x12 + die%22 = ~—Cig 
(411 + %22 + baa)wa1 + bei. = —Car 


with matrix coefficients, and 


X07 + XoiX19 + beixie + daoxee = —Cag 
and that the elements x11, 22, %12, ¥21 are independent only if 
211 + b11 %o1 X42 + dis 0 
om #12 #11 + %e2 + dis 0 #12 + die #0 
xa1 + bey 0 X11 + X2e + dee xr 
0 %o1 + bai X12 2x22 + bee 


21. @ and & are two square matrices of like order whose product is zero, that is, 
Q@ x B = 0 in which 0 is a null matrix of the same order as @ or B. Let @ be given 
and & unknown. Write the complete set of equations whose solution (if it exists) 
yields the elements 5,, in terms of the elements a,,. If @ has the order » and the rank 
mn, what is the solution? Discuss the possibility and form of solutions if @ has the 
tank  — 1, and more generally if @ has the rank n — p. 
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Solve the specific equations 
1 S51 1 5 1 
3 15 O|xB=0 and BxI{3 15 Of =0 
4 20 1 4 20 1 


22. Given the equation 
XC? + 3X — 10U =0 
in which 9% is a square matrix and “U is the unit matrix of like order. Find solutions 


of the form x,°U in which x, is a scalar and show that one may write the matrix 
polynomial in factored form 


(Xx ~ x1 U) x (X _ xe) = 2 + 390 — 10% 
Check this result for the particular value 


23. If C is a matrix with complex elements ci, = dix + jb, and if @ has the 
conjugate elements, show that although C +C is real and ec = C is imaginary, it 
does not follow in general that C x C is real nor that C x C = @ x@. 


CHAPTERIII 


Linear Transformations 


1. VECTOR SETS 


The primary object of this chapter is to offer a means of visualization 
for the purely algebraic reasoning underlying the essential ideas presented 
in Chs. I and II. This object is accomplished by giving a geometrical 
interpretation to the linear transformation and the matrix which charac- 
terizes it. There are, of course, a variety of possible forms which such a 
geometrical interpretation may take. The point of view given here is 
chosen for its utility in the particular applications discussed in the 
reference volumes. 

In order to keep the discussion fairly general initially, the matrix is 
assumed to have m rows and x columns instead of being square. It is 
written 


41 2 Qin 
Q = doy G22 *°° Gan [1] 
Gmi Om2*** Inn 


If the elements of each row are considered to be the components of a 
space vector with reference to a chosen rectangular Cartesian co-ordinate 
system, the matrix assumes an easily visualized geometrical characteriza- 
tion. It represents a set of vectors all emanating from a common origin so 
as to form a cluster. Since the number of components of each vector is in 
general greater than three, the process of visualization lacks the physical 
clarity which ordinarily accompanies the conception of a space vector. 
A purely mathematical extension of the idea of ordinary space to the 
conception of a space of many dimensions involves, however, only a pass- 
ing mental hazard. 

The co-ordinate axes in this many-dimensional space are for conven- 
ience numbered 1, 2, 3, - - - etc., instead of being lettered x, y, z, as in the 
usual three-dimensional case. Thus the elements 11, @12,--- a1, of the 
first row of @, for example, are looked upon as the components (projec- 
tions) of a vector a; with reference to the rectangular co-ordinate axes 
1, 2,---# in an -dimensional space. In like manner, the elements @g1, 
@o2,°** Gan are looked upon as the components of a vector dp. Finally, 
the elements of the mth row define a vector a,. The cluster of vectors 
1, @g, - - * @m is spoken of as the vector set of @. 

It is possible for the matrix @ to be characterized by an alternative 
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vector set, namely the m vectors a,‘, a2',---a@,' whose components are 
the elements taken by columns instead of by rows. In this case, the space 
occupied by the vectors is m-dimensional, because each vector now has m 
components. This cluster of vectors is called the transposed vector set of 
G, since it is the vector set of the transpose of @. 


2. LINEAR DEPENDENCE AND INDEPENDENCE; THE RANK OF A VECTOR 
SET 


If one or more of the vectors in a set can be expressed as a linear vector 
addition of the remaining vectors multiplied by suitable positive or 
negative numerical factors, the set is said to be linearly dependent. If such 
an expression is not possible, the vectors are linearly independent. For a 
set of m vectors the possible existence of either condition may be ex- 
pressed mathematically by the statement that if one can not find m 


numbers 71, Y2,-*- Ym (excluding the choice 7, = yz =--+ = Ym = 0) 
for which the following vector equation holds 
Y101 + Yod2 +--+ + ¥m@m = 0 [2] 


the vectors form an independent (otherwise a dependent) set. 

In three-dimensional space, any three vectors which do not lie in a 
plane form an independent set. If the three vectors do lie in a plane, one 
or more of them evidently can be expressed as a vector sum of the others 
multiplied by suitable factors. If in this same space the number of vectors 
is greater than three, the set must necessarily be dependent. In order for 
the vectors to form an independent set, it is necessary that their number 
shall not exceed three, although this condition alone is clearly not 
sufficient. ‘ 

In an n-dimensional space, the largest number of independent vectors 
is n, but a dependent set may contain any number of vectors. The num- 
ber of available dimensions in such a space is . The vectors occupying 
this space may have such relative orientations that fewer than  dimen- 
sions are actually consumed or utilized. In three-dimensional space, for 
example, a. set of vectors which lie in a plane consumes only two dimen- 
sions, and if they lie in a line they utilize only one of the available three 
dimensions. 

It is clear that if the number of dimensions utilized is less than the 
number of vectors in the set, the vectors are linearly dependent; but if the 
number of utilized dimensions equals the number of vectors (it obviously 
cannot be greater than the number of vectors in the set), the vectors are 
linearly independent. If the number of vectors is equal to or less than the 
number of available dimensions, both cases can occur; but if the number 
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of vectors is larger than the number of available dimensions, the only 
possibility is for the vector set to be a dependent one. 

The number of dimensions actually consumed by a vector set is by 
definition equal to the rank of that set. The vector set of @, Eq. 1, con- 
sists of m vectors in an m-dimensional space. The rank of this set can at 
most be equal to 2. If m > n, the vector set of @ must be dependent, for 
the rank is then necessarily less than the number of vectors. In order to 
determine the rank, one may begin by considering consecutively all 
groups of 2 vectors which can be selected from the given set (the number 
of such groups equals the number of combinations of m things taken 7 at a 
time) and examining them to determine whether a linear relation of the 
form given by Eq. 2 can be found to exist. If one or more groups can be 
found for which such a relation does not exist, the rank is ; if it exists for 
all groups, the rank is less than ”, and one must proceed to investigate 
in the same fashion all groups of (#—1) vectors which can be selected 
from the given set. The largest number of independent vectors which can 
eventually be selected in this manner is equal to the rank of the vector 
set. 

In the light of the discussion in Art. 10, Ch. I, regarding the rank ofa 
determinant, the procedure just described may be seen to be equivalent 
to the following statements. From the matrix @ (with m > n), all the 
n-rowed determinants are selected. There are as many of these as there 
are m things taken » at a time, and they correspond to the groups of 7 
vectors previously mentioned. The highest rank to be found among these 
determinants is the rank of a vector set of @. 

In order to appreciate the truth of the latter assertion, it is necessary 
first to observe that the independence of (m — 1) vectors in an m-dimen- 
sional space may be established by consideration of all the “ projections ” 
of these vectors onto the » co-ordinate “ planes,” comprising (7 — 1) 
dimensions each. Such a “ projection ” is carried out by simply allowing 
the components of all the vectors along one particular axis of the space 
to go to zero, or be disregarded. Consideration of the problem of two 
vectors in three dimensions will lead to the recognition that a necessary 
and sufficient condition for independence of the (m — 1) vectors is that at 
least one of these » projected sets should be independent. Similarly, to 
insure dependence of the set, all the 7 projected sets must be dependent. 
In an identical manner the independence of (m — p) vectors in an n- 
dimensional space is determined by the independence of at least one of the 
group of all possible (# — p)-dimensiona. “projections” of the set, 
formed by striking out » components at a time from all the vectors. If, 
and only if, all such projected sets are dependent, the set itself is de- 
pendent. It is then pointed out that all possible groups of (n — p) vectors 
which can be formed from the original group of m may be regarded as 
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selected by choosing all possible groups of (x — p) vectors from each of 
the groups of vectors originally selected. The process of ascertaining the 
highest rank among the » determinants first chosen is then seen to be 
exactly equivalent to finding the largest number of independent vectors 
in the set by a method of successive projections. 

In the transposed vector set of @ the number of vectors is ” and the 
number of available dimensions is m. Still assuming m > n, the rank can 
at most be ”, and is equal to the largest number of independent vectors 
which can be formed from the given set. The dependence or independence 
of a selected group of vectors is ascertained through noting whether or 
not a relation of the form given by Eq. 2 exists. This process of investiga- 
tion is, however, equivalent to determining the highest rank to be found 
among all determinants formed from the transpose of @ by selecting n 
columns in all possible combinations. These determinants are simply the 
transposed ones of those referred to in the preceding paragraph. Since the 
rank of a determinant is the same as that of its transpose, it therefore 
follows that the vector set of the transpose of @ has the same rank as the 
vector set of @. 

The same conclusion holds when m < n.and m = n. When m # n, the 
rank of the vector set or its transpose is at most equal to the smaller of 
the numbers m and 2. When m = n, which is the case of greatest practical 
importance, the definition of the rank of the vector sets agrees with that 
already given for the rank of a square matrix and its determinant.* 


3.. VECTOR SIGNIFICANCE OF A LINEAR TRANSFORMATION 


The ideas developed in the preceding articles suggest a useful geometri- 
cal interpretation for the linear transformation 


041%y +++ + Onkn = VN 
Rares ee a kee [3] 
QniX1 Sie annkn = Yn 
with the square matrix 
M1 °° * Gin 
6 igen Pree errr [4] 
Gant ++ * Ann 


In the -dimensional space occupied by the vector set of @, two addi- 
tional vectors « and y, with components 1, %2,-°++ %n and V1, Ye, - ++ Yn 
respectively, are defined. The left-hand sides of the equations in the set 3 

*In connection with the discussion of this article it may be of interest also to refer:to Art. 6, 
Ch. IV, for an alternative method for establishing the linear dependence or independence of 
a given vector set. 
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are then recognized to be the scalar products of the vectors 41, dz, +--+ @n 
respectively and the vector x. 

The scalar product (cf. Art. 2, Ch. V) is expressible either as the sum of 
the products of corresponding components of two vectors or as the product 
of their lengths and the cosine of the angle between them. If the given 
vectors are at right angles to each other, their scalar product is evidently 
zero. 

With the scalar product denoted by a dot placed between the symbols 
for the vectors (Gibbs’s notation), Eqs. 3 may be written in the alterna- 
tive compact form 


Yt N 
ag:X% = Vo [5] 
Qn*% = Vn 


These equations are said to transform a vector x into a vector y, the 
components of the latter being the scalar products of the vector x with 
those in the vector set of @. 

Any vector x with given length and direction is transformed into 
another vector y by means of Eqs. 5, the mechanism of the transforma- 
tion being determined by the cluster of vectors a1, d2, ++ @n. As the 
length and direction of the vector x are varied at will, the length and 
direction of the vector y vary in a corresponding manner. The vectors 
x and y may be visualized as rods emanating from a box which contains 
the mechanism of the transformation as characterized by the matrix @ 
or its vector set. As the rod representing x is pulled out or pushed in and’ 
its direction changed, the mechanism in the box causes the rod represent- 
ing y to lengthen or shorten and change its direction in a corresponding 
manner. 

For the moment, the mechanism in the box may be assumed to be so 
constituted that each length and direction of the rod representing x 
uniquely determines a length and direction for the rod representing y. 
This is evidently the case if the matrix @ is nonsingular. Then the trans- 
formation is reversible; that is, the rod representing y may be moved 
about at will, thus causing the mechanism in the box to produce corre- 
sponding lengths and directions for the rod x. A given point in space for 
the tip of the rod x determines one (and only one) point for the tip of the 
rod y, the corresponding points being independent of whether the rod x 
is pushed and the rod y caused to follow or vice versa.* 


*In particular it may be of interest to note that if the vector x is moved about in a plane, 
the vector y remains in a plane also; and if the tip of the x-vector follows a straight line, the 
tip of the y-vector likewise follows a straight line. These matters are treated in greater detail 
in the discussion immediately following Eq. 168 in Art. 10. 
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Viewed in this mechanical manner, the transformation 5 and its 
inverse involve the same mechanism. Algebraically, however, the inverse 
transformation 

bdr tt + bine = 11 


eee ar rr [6] 
BniV1 +++ + OnnYn = Xn 
written in vector form 
b-y= 
eee (7 
bn ‘Y= Xn 
involves the vectors 61, 62, - + - bn, which are the vector set of the inverse 
matrix 
bu : bin 
Gea Bias iy eensens [3] 
Ont Onn 


Since B X @ equals the unit matrix, and is formed through multiplying 

the rows of & by the columns of @, it follows that the scalar product 
1 fort =k 
bis an! =4 0 forixk [9] 

The vectors a‘, a2’, -++a,' are the transposed vector set of @. More 
specifically, the relations 9 state that 6,, for example, stands at right 
angles to the vectors a2’, a3‘, ---d,‘, whereas its scalar product with 
a,‘ is unity; b2 stands at ‘Tight angles to the vectors a’, a3’, - - - ay’, and its 
scalar product with a2‘ is unity, and so forth. This situation offers an 
interesting geometrical view of the relation betwee a nonsingular matrix 
and its inverse. 

If the square matrix @ is symmetrical, 


a= 
ae" = ao [10] 
Qn’ = On 


Then the orthogonal relationships just described hold directly for the 
vector set of @ and the vector set of its inverse. 


4. ORTHOGONAL CO-ORDINATE TRANSFORMATIONS; ORTHOGONAL VEC- 
TOR SETS 


It sometimes becomes necessary in dealing with physical problems to 
transform from one co-ordinate system to another. With respect to a 
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given reference system whose rectangular axes are numbered 1, 2, --- x, 
the co-ordinates of a particular point may be denoted correspondingly 
by the quantities x1, %2, : - + %,. In a second rectangular reference system 
whose origin coincides with that of the given one, the axes which are 
labeled 1’, 2’, ---’ have different angular orientations with respect to 
the axes of the original system. This second reference system may be 
thought of as obtained by simply rotating the axes of the given system, 
keeping the origin fixed and maintaining the mutual orthogonality of the 
axes. 

The point in space whose projections on the original set of axes are the 
quantities x1, %2,- - - X, has projections on the axes of the second reference 
system which are conveniently denoted by 2’1, x’2, -+ + %’n. These are the 
co-ordinates of the same point with respect to the second reference 
system. The process of expressing the quantities x’1, x’2,---+ x’, in terms 
of x1, %2, °- - Xn is spoken of as a co-ordinate transformation. 

In order for such a co-ordinate transformation to be carried out, the 
relative orientations of the axes in the original and the new reference 
systems must, of course, be known. Algebraically the directions of the 
new axes with respect to the old ones are expressed by quantities called 
direction cosines. If the cosines of the angles between axis 1’ and the 
axes 1, 2,-- - mare denoted respectively by the coefficients 011, 012, °° - O1n, 


the cosines of the angles between axis 2’ and the axes 1, 2, - - - m respec- 
tively by 021, 022, -- + 02n, and so forth, the array of coefficients in the 
matrix 
011 O12 O1n 
O= 021 022 *** Don [11] 
On1 Ong °** Onn 


‘are the complete set of direction cosines of the new axes with respect to 
the old ones. 

The desired co-ordinate transformation is then expressed by the linear 
set of equations 


, 
011% + O19%2 + +++ + O1nXn = X'y 
: / 

Og1%1 + Oog%e + +++ + Oontn = X'9 [12] 


SS 


OniX1 + On2eX%g + +++ + Onntn = Kn 


In order to appreciate the correctness of this result with regard to its 
general form, and also to clarify the geometrical relations involved, the 
reader may carry through the derivation of this transformation for the 
simple two-dimensional case. 

In the language of the vector interpretation given in the preceding 
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article, Eqs. 12 transform a vector x with the components 41, %2, + ° + Xn 
into a vector x’ with the components 2’;, x9, --- xn. Since the origin 
for both co-ordinate systems is the same and only one point in space is 
involved, x and x’ are the same vector. It has a different direction with 
respect to the new reference axes from that which it has with respect to 
the old axes, but its length must certainly be the same in both reference 
systems. This fact means that 


wy? bag? pve ety? = 201? + ag? + 0+ + 209? [13] 

The right-hand side of this equation may be obtained through mul- 

tiplying Eqs. 12 respectively by x’1, x’9,--- x’, and adding. This process 

leads to a double summation. In order to abbreviate the writing of the 

result, it is effective to use the summation sign. Thus Eq. 13 with 12 sub- 
stituted becomes ' 


x a 7 = x x OgrXrX’ 5 = >> ae? [14] 


s=1 s=1 r=1 r= 
Reversing the order of summation in the double sum yields 
n n n 
x x a) vy = x XyXy [15] 
r= $= r= 


from which it becomes clear that the stipulation expressed by Eq. 13 
leads to the result 


n 
, 
YX Oert’s = Xp {16] 
s=1 
Written out for r = 1, 2, ---, this reads 
, i 7 
0x10 1 + Ogi 2g +e > + OX n = M1 
i / 
012% 1 + On0% 2 +++ + Onet n = Xe [17] 
, a , 
O1n® 1 + OonX¥'g Fees + Onn® n = Xn 


This is the inverse transformation with respect to the one expressed by 
Eqs. 12. Its matrix is seen to be the transpose of 0. A matrix whose 
transpose is equal to its inverse is shown in Art. 6, Ch. II, to have the 
properties expressed by Eq. 78 or more specifically by Eqs. 79 and 80 of 
that chapter, 

The direction cosines of the new set of rectangular co-ordinate axes 
with respect to the old ones, therefore, satisfy the relations 


1 fors=r 
051071. + 0920r2 +++ * + OsnOrn = { 0 Rane ~r [18] 
and 
1 fors=r 
01017 + 02900r + *°* + OnsOnr = { 0 a s#r [19] 
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In terms of the vector set of O, that is, 01, 02, -- + 0, and the transposed 
vector set 01', 02°, - - - 0’, these relations may be expressed by the scalar 
products 
f1 fors=r . 
oor = { 4 for s ¥r [20] 
and 
1 fors=r 
Oe = { 0 fors#r [21] 


It thus becomes clear that the vector set of the matrix belonging to the 
transformation from one rectangular co-ordinate system to another con- 
sists of a cluster of mutually orthogonal vectors of unit length. The same is 
true of the transposed vector set, which also is the vector set of the inverse 
matrix. The reason for designating this kind of matrix as an orthogonal 
one is thus evident. The vectors likewise are said to form an orthogonal 
set. 

The relation 9, which is shown in the previous article to hold for the 
elements of any nonsingular matrix and those of its inverse, becomes 
identical with Eq. 21 when written for the elements of the orthogonal 
matrix and those of its inverse or transpose. 

As pointed out in the discussion of the orthogonal matrix, the product 


Ox0=OxO0T=% [22] 


Since the determinant of the unit matrix U is unity, and the determinant 
of ©; is the same as that of 0, it follows that 


O?=1, or O=H1 [23] 


A closer examination of the geometry involved shows that the value of 
the determinant O is +1 if the directional sequence of the vectors 
01, 02, °* + On corresponds to a right-hand screw rule, and it is —1 if this 
sequence follows a left-hand screw rule. If both the reference axes 1, 
2,--++m, and 1’, 2’, --- mn’ form right-hand or both left-hand systems, 
O = +1; but if one is a right- and the other a left-hand system, O = —1. 

Besides being the matrix of a transformation representing the rotation 
of a rectangular system of reference axes, the orthogonal matrix is 
evidently also the matrix of a transformation for which a vector x and its 
transform yy have the same length. In other words, a linear transformation 
may represent either a change from one co-ordinate system to another or 
the transformation from one point in space to another with respect to the 
same reference system. If the transformation is an orthogonal one, it 
represents a pure rotation about the origin; that is, a rotation either of the 
co-ordinate system for a fixed point, or of an arbitrardfboint in a fixed 
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system. In either case, the axes of the reference system are at right angles 
to each other if the transformation is an orthogonal one. 

Such a rectangular co-ordinate system is commonly called a Cartesian 
system, though Descartes (after whom it is named) also devoted many 
of his studies to systems of co-ordinates in which the axes make oblique 
angles with each other. In a three-dimensional oblique reference system, 
the projection (co-ordinate) of a point on a given axis is found by dropping 
a line to this axis parallel to the plane of the other two axes (in a rectangu- 
lar reference system this line is the perpendicular dropped from the point 
in question). An analogous interpretation applies to oblique systems of 
more dimensions. 

Any linear nonsingular transformation may be regarded as a trans- 
formation of the co-ordinates of a point in a given oblique co-ordinate 
system to those of the same point in another system with coincident 
origin, or it may be regarded as the transformation from one point to 
another in the same oblique co-ordinate system. 

An oblique co-ordinate system is also called an. affine system. The 
transformation from one such system to another or from one point to 
another in the same system is sometimes referred to as a linear affine 
transformation. The orthogonal transformation is a special case of this 
more general type. 


5. TRANSFORMATION TO AN OBLIQUE CO-ORDINATE SYSTEM 


In the transformation from one orthogonal co-ordinate system to 
another, the direction cosines of the axes 1’ with respect to the axes 
1, 2,-+-+m (these are the elements of the first row of the matrix O, Eq. 11) 
may be regarded as the components (projections) of a unit vector lying 
in the axis 1’. This is the vector 0; of the vector set of ©. Similarly the 
vector 02 of this set is seen to be a unit vector emanating from the com- 
mon origin of the two orthogonal co-ordinate systems, coincident in 
direction with axis 2’. The vector set 01, 02, ° - * 0n of O is thus seen to be a 
set of coterminus unit vectors coincident respectively with the orthogonal 
axes of the system 1’, 2’,---’. By a similar line of reasoning, the trans- 
posed vector set 01, 09°, -- - On’ is recognized as a set of coterminus unit 
vectors coincident respectively with the orthogonal axes of the co- 
ordinate system 1, 2,---m. 

This helpful visualization or geometrical interpretation of the co- 
efficients of a transformation matrix may be extended to the nonor- 
thogonal matrix associated with the transformation to an oblique 
(affine) co-ordinate system, but several novel features enter into the 
geometrical interpretations, requiring further detailed discussion. 

From the algebraic point of view, the number of dimensions of the 
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space under consideration is arbitrary, but for facilitating the geometrical 
visualization of the argument, reference is made to the two-dimensional 
Fig. 1. Here there are drawn three co-ordinate systems — one rectangular 
Cartesian system and two oblique systems. The axes of the rectangular 
system are designated by the letters X, and Xp». The first oblique system 
has the axes =, and Zo, and the second oblique system has the axes 2*, 


X, 


Led 
* A 


Fic. 1. The vector x described in equivalent fashion in oblique and rectangular 
co-ordinate systems. 


and =*, which are at right angles to the axes =, and &, respectively of 
the first oblique system. The reason for considering the two oblique 
systems = and %* (with the relative orientations of axes as just described) 
in conjunction with the rectangular system X follows from the discussion 
below. 

For the rectangular co-ordinate system X, a set of unit vectors wy, 
Ug, *** Un is defined which emanate from the common origin O and are 
coincident respectively with the axes 1, 2,--- of this system, and hence 
are mutually at right angles with each other. A vector x from O to P has 
projections on the axes 1, 2, --- ~ which are denoted by x1, x2, --- Xn 
respectively. The vector components of this vector are “1X1, Ug%g,°* + UnXn- 
The vector sum of these components, according to the usual parallelogram 
(or parallelopiped) rule of vector addition, yields the vector x, thus: 


H = UX, + Ughe + +--+ UnXy [24] 
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In order to provide an equivalent representation for the vector x in 
the oblique system %, a set of unit vectors a1, de, - - - dy is defined, which 
emanate from the common origin O and are coincident respectively with 
the axes 1, 2, - - - 2 of this oblique system. The projections of the vector x 
(co-ordinates of the point P) upon these oblique axes are denoted by 
£1, £2, --- & respectively. The vector components of x in the oblique 
system & are, therefore, a), dof, --- @nén, and their vector sum also 
yields the vector x, that is, 


% = ab + oko +--+ + Onén [25] 


Finally, the vector x may also be represented in the second oblique 
system =* by defining for it the set of unit vectors a*), a*2, --- at, 
emanating from the common origin O and coinciding respectively with 
the axes 1, 2, --- ” of this system. The projections of x upon these axes 
are denoted respectively by &1, &2, --- &*n. The vector components. 
of x in the system * are a*,£*,, a*9f*o, -- - a*,&*,, and their vector sum 
yields 


w= aye) + a®ot sy tee + ane, [26] 
The unit vectors 1, 42, + + - 4% are regarded as the vector set of a matrix 


U1 U2 *** Un 
un U ene 
aL is 21 22 Yan [27] 


ee 


Unt Ung *** Unn 


in which the elements are the components of the vectors with respect to 
the rectangular co-ordinate system X. According to the definition of this 
vector set, it follows that this matrix is the unit matrix 


10 0---0 
aT a Beneeeeeee [28] 
0 0 1 


The unit vectors a1, d2, + - - d, of the co-ordinate system & are regarded 
as the vector set of a matrix 


Qi 2 Gin 
a eee a: 

Q= 21 dee Qn [29] 
GQ@ai neg ann 


in which the elements are the components of the vectors with respect to 
the rectangular system X also. For example, @11, @i2, --* @in are the 
projections of the vector a, upon the axes 1, 2, -- - m respectively of the 
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system X; 21, Goo, “+ * Aon are the projections of the vector a2 upon the 
axes 1, 2, --- respectively of the system X, and so forth. 
Similarly the unit vectors a*,, a*2, --- a*, of the co-ordinate system 
&* are regarded as the vector set of a matrix 
a1, a*y2 +++ On 
Qt = a5) a* on +s a on) [30] 
a*ny Ong ann 


in which the elements again are the components of the vectors with 


respect to the rectangular system X. That is, a*11, a* 2, - ++ a*,, are the 
projections of the vector a*, upon the axes 1, 2, - - - m respectively of the 
system X; a*p;, a*go, -- + a@*g, are the projections of the vector a*, upon 


the axes 1, 2, - -- m respectively of the system Xx, = so forth. 

According to the definition of the systems % and &*, the axis 1 of % is 
orthogonal to the axes 2, 3,---” of Z*; the axis 2 of Zis orthogonal to the 
axes 1,3,--- of Z*, and so forth: Hence the scalar products of a, with 
the vectors a*,, a*3,---a*, are zero; the scalar products of the vector a2 
with the vector a*,, a*3, - - - a*» are zero, etc. 

At this point a peculiarity of affine geometry (usually souniseny to 
engineers) must be clearly understood. A distinction must be made be- 
tween the scale by means of which any length is measured and the scales 
which are attached to the oblique co-ordinate axes. The “scale of 
length ” is that which is attached to the rectangular Cartesian axes X 
(this is the same for all the axes 1, 2, --- ” in this system). The oblique 
axes carry their own scales but they are not used to measure length. 
The system 2 may in general have a different scale for each of its axes, 
and they are different from the scales which are used to ny, off the units 
on the axes of the system X*. The unit vectors a1, @2, --- dm are units 
aecording to the scales for the axes of the X system, and the unit vectors 
a*;, a*y, +--+ a*, are units according to the scales for the axes of the Z* 
system, but none of these unit vectors in general has unit length according to 
the “scale of length,” which is the scale for the units laid off on the axes 
of the system X. 

The reader should appreciate that a scale on a co-ordinate axis may, 
in general, have a twofold purpose: 

(a) It may be used to determine the value of the projection of a point 

upon that axis. 

(b) It may be used as a tape measure is used to determine the distance 

between two points in space. 

A scale may be used for either purpose alone, or for both. Use (a) is 
the one with which the reader is undoubtedly most familiar in connection 
with his analytic work. For example, when plotting a function y = f(x), 
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scales are employed for the x- and y-axes which seldom have the same 
size of unit, and are used merely to read values of x.and y, not also to 
measure length or distance. 

Interest in the measurement of length arises only when geometrical 
considerations enter into the problem. When they do, one must have a 
“ scale of length ’’ (tape measure) in addition to the various scales whose 
use is restricted to the determination of the values of projections. In the 
present problem the identical scales carried by the axes of the system X 
serve also for the measurement of length. This designation having been 
made, it is obvious that no other scales (except if they are identical with 
those of the X-system) may be used for the measurement of length. 

For curvilinear co-ordinates the situation is even more confusing, since 
the scales which the co-ordinate axes bear not only are in general different 
for the different axes of the same system, but also vary from point to point 
along these axes, whereas the scale of length is independent of the co- 
ordinate system and independent of the location within any system. As 
long as the oblique axes are linear, their scales are the same for all points 
within the system. 

The units for the scales carried by the axes of the oblique systems may 
have their lengths (as measured by the scale of the X. cayeten) so adjusted 
that the scalar produces of the unit vectors @1, d2, +--+ Gn respectively with 
the unit vectors a*,, a*,,---a*, are all unity. (This adjustment is called 
“ normalization.”) For Seance, if the angle between a, and a*, is a, 
the lengths of a, and a*, (that is, |a,| and |e*,|) are determined so that 
|a,| < |a*,| cos o4 equals unity. Since a1 is an arbitrary angle, it is clear 
that although the length of either a, or a*, can be chosen equal to unity, 
both unit vectors certainly cannot have unit length. _ 

As a result of this normalization process and the orthogonality between 
unlike numbered axes in the systems % and &*, the sets of unit vectors 
in these systems satisfy the following conditions: 


1 fori=k 
vo hl — 
a { 0 fori #k [31] 


in which the scalar product is indicated by a dot. In terms of the matrices 
29 and 30 these conditions are expressed by the matrix equation 


#8 Ok 
a, a*o1 ! a* nt 


G44 G2 + °° Gin a*12 A*a9 2 O*ng 1 0 0-:-90 
21 M2 don x)” soo ve 0 1 0---0 [32] 
Qni Ang ann 0 0 1 
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which is equivalent to 
@x @*, = U [33] 


Hence it follows that 
at, = @} or @* = @,;} [34] 


The matrix @* is, therefore, the reciprocal of @. The vector set a*,, 
a*,,++-+a*,, is called the reciprocal of the set a, dg, -- - @,, and the oblique 
co-ordinate system =Z* is referred to as the reciprocal of the oblique 
system =. The present result yields a useful geometrical interpretation 
for the elements of a square matrix and those of its reciprocal. Inciden- 
tally, it may be recalled in this connection that an orthogonal matrix is 
its own reciprocal, and this conclusion checks with the fact that an 
orthogonal co-ordinate system is also its own reciprocal. 

By use of one of the three equivalent relations for the vector x given by 
Eqs. 24, 25, and 26, the length of this vector may be expressed in terms 
of the co-ordinates of the rectangular system X, or of either of the oblique 
systems = or &*. The square of this length is evidently given by the 
scalar product of the vector. x with itself. Using Eq. 24, one has 


|2e|? = XH = (4% + Ugh + +++ + UnXn) 
+ (Myx + Uta + +++ + Unk) [35] 


Because of the mutual orthogonality of the unit vectors, the scalar 
product of any vector with any other in the set is zero, whereas this 
product of any vector with itself is unity. Hence in terms of the co- 
ordinates of the rectangular system X, 


Jacl? = ay? + ay? +--+ + ay? [36] 
which is the familiar Pythagorean proposition extended to -dimensional 
space. 

This result may be expressed in terms of the co-ordinates &, , +++ & 


of the vector x in the oblique system 2 by first determining the expres- 
sions for x1, %2, - + * X, in terms of the é’s, and then substituting these into 
Eq. 36. A general expression for the components 21, %2, - : - %» is evidently 
given by 


Xp = uy,-x for k=1,2,---n [37] 
which states that the projections of x on the axes 1, 2, -- - ~ of the system 
X are the scalar products of « with the unit vectors #1, we, - -- 4» of this 


system. 
Substituting Eq. 25 into Eq. 37 yields 


Lp = Uy + Ay by ~- Uy Agta +++ + + Uy Onkn [38] 


Art, 5). TRANSFORMATION TO OBLIQUE CO-ORDINATE SYSTEM 9 


But 
Uy 25 = it [39] 


because the scalar product of « with a; equals the projection of the 
vector a; upon axis k of the rectangular system X. Hence Eq. 38, written 
out for k = 1, 2,--+-m, reads 


Hy = Oy1&1 + doike +++ + Anién 
%q = Q12& + dagge +°++> + Anata [40] 


Xn = Anki + dene + +++ + Annbkn 


This set of equations may be written in matrix form by defining the 
column matrices 


% 
X2 
x)=} [41] 
Xn 
and 
fi 
£2 
es [42] 
En 
With the transpose of the matrix 29, Eqs. 40 evidently are expressed by 
«] = Gy é] [43] 


Substitution of this result into Eq. 36 is now greatly facilitated by 
noting that 


v1 
%2 


|x|? = ale X @] = [ay aes + an] X] | = a? tame? +--+ +a? [44] 


Xn 


Thus the square of the length of the vector x, expressed in terms of the 
co-ordinates of the oblique system =, becomes 


|x|? = £], xX @Q, x €] [45] 
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Here it is effective to let 


G=@x @, =| 82 822 *** Sa [46] 


CC 


8n1  8n2°°* Snn 
The elements of this matrix are evidently expressed by the scalar products 


Lik = O;* OE [47] 


from which it is clear that the matrix § is symmetrical. 
Equation 45 may now be written out, as follows: 


||? = grb? + giotito +++ + ginkiEn 
+ arkoét + Sooke? +-+++ gontotn 


a 


+ &nién’ + Bnaénte +++ + Sanén? [48] 


In the oblique system =, the squared length of the vector « is not simply 
equal to the sum of the squares of its components in that system, but the 
result expressed by Eq. 48 shows that all cross-product terms are present 
as well. Because of the symmetry of the § matrix, which is expressed by 


Sik = Bki [49] 


it is clear that all the cross-product terms in Eq. 48 occur twice. This 
fact makes possible some condensation of the form of Eq. 48. 

Tf all the scales for the axes of the Z system are chosen equal to the. 
scale of length (as can be and usually is done), the unit vectors a; have 
unit length. It then follows that all the elements on the principal diagonal 
of the matrix § become equal to unity. If the angle between the axes 
z and k in the & system is denoted by @,,, the result in Eq. 47 shows that, 
for this choice of scales, the elements of G are given by the simple formula 


Lik = COS Az [50] 


which yields unity for i = &. 

The determination of the matrix G is intimately connected with the 
choice of a scale in terms of which a length may be measured in the 
oblique co-ordinate system. Length in this system is defined by the matrix. 
&G. The latter is the matrix of the so-called fundamental metric tensor 
whose components are the elements g;x. 

The present discussion should be recognized as fundamental to tensor 
algebra also, even though we are not concerned here with that closely 
related subject. Matrix equations and tensor equations, in fact, differ 
only in notation. The geometrical interpretations of them, as far as this 
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discussion is concerned, are identical. Tensor notation is indeed some- 
what more advantageous than matrix notation for the operations now 
under consideration. The view taken in this book, however, is that 
matrix algebra is an invaluable aid to clear comprehension of the tensor 
method, and that the student will more readily assimilate that method 
once he has understood the essential ideas underlying the present dis- 
cussion. Proper grasp of the tensor method comes not from the mere © 
acquisition of a set of manipulative rules but rather from recognition that 
tensor algebra is a symbolic representation of geometrical and physical 
ideas. 

The square of the length of the vector x given by Eq. 36 may also be 
expressed in terms of the co-ordinates é* of the reciprocal oblique system 
=*. Thus, using Eq. 26, the co-ordinates of x in the rectangular system 

are, according to Eq. 37, given by 


we = Uy OE") + Uy Oaks ++ + Uy One n [S1] 
Here it is recognized that the scalar product 
u,-a*; = a* [S2] 


so that Eq. 51 written out for k = 1,2,+--m reads 
Hy = O18") + a¥ abo + + atid, 


Xo = a yb) + a¥ ante +--+ + a* not n 


Ce 


Xn = OM int) + a* ong ts +++ + Cant n [53] 
With the definition of the column matrix 
aan 
ar 
f] =| | [54] 
*n 


and the transpose of the reciprocal matrix 30, Eqs. 53 are given in matrix 
form by 


a] = @*,e*] [55] 
Utilizing again the relation expressed by Eq. 44, the square of the length 


- of the vector x in terms of the co-ordinates of the reciprocal oblique sys- 
tem reads 


| fa? = Lx OO, x FY [56] 
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Here it is effective to introduce the matrix 


* * 
811 S12 "°° Sin 


g* = Q* x @*¥, = Bor gta +++ Ban [57] 


ona one sous Ban 


- The elements of this matrix are given by the scalar products 


Brin = 0%; a, = 8 ni [58] 
In view of Eqs. 34 and 46, it is recognized that 
g* = a, >< @- = Gi x Gi = gi [59] 


The matrix §*, which is fundamental to the measurement oi length in 
the reciprocal oblique system, is the inverse of the matrix $. Hence 


Git 
fa = [60] 
in which G is the determinant of §, and G,, its cofactors. Alternatively, 
: G* 
Sik = Ge [61] 


in which G* is the determinant of $* and G*,, its cofactors. Evidently, 
by Eqs. 46 and 57, 


1 
~. Gt 
A representation for the elements g*,, similar in form to that given by 
Eq. 50 for giz, is, according to Eq. 58, 
gin = |a*,| x |a*,| cos OF i, [63] 


G= A? [62] 


in which |a*;| denotes the length of a unit vector in the reciprocal co- 
ordinate system 2*, and 6*;, is the angle between the axes z and & in this 
system. 

In general, the lengths* of the unit vectors in the oblique systems are 
given by the expressions 


Jas] = Vaan? + aig? ++ + Gin? = Ve: [64] 
and 
lat,| = Varn? + aa? to Fan? = VG [65] 


*The term “ magnitude ” would actually be more appropriate here than “ length ” because 
the unit vectors a; and a*;, like those for the rectangular coordinates, are assumed to be 
without dimensions. 
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When the former are chosen equal to unity, the latter are in general all 
different from unity; that is, each axis of the 2* system carries a different 
scale. 

The result given by Eq. 56 for the square of the length of the vector x 
in terms of the co-ordinates of the reciprocal oblique system, when 
written out, reads 

focl? = g¥ir¥1? + gh ioeie%o tee + Sink ie n 
+ gt oie*oe*) + g*ookte? + +++ + g* ant at n [66] 


+ S nik nt "1 + Snot nts +e + Sint a 


Again all cross-product terms (double products because g*iz, = g*xi) are 
present in addition to the square terms. 

Still another form for |x|? is possible if in Eq. 44 both of the alternative 
expressions 43 and 55 for x] are substituted, one for x], and the other 
for x]. Then the result reads 


|x|? = é]: x @Q*, x &] [67] 
or , 
|x|? = e], x @*@, x é] [68] 
But by Eq. 33 | 
@ x G*, = QF x GQ, = U [69] 
so that . 


|x|? = 2]. x] =e]. x €] 
= EEF) + bets +--+ + Enb*n [70] 


In this expression, no cross-product terms are present. It closely 
resembles the simple Pythagorean form given by Eq. 36, the only dif- 
ference being that products of corresponding components of the vector x 
in the two oblique systems appear in place of the squares of one kind of 
component. 

The & are called the contravariant components and the &*, the co- 
variant* components of the vector x. The contravariant components 
are the components of x in the oblique system 2, and the covariant com- 
ponents are those in the reciprocal system =*. Note in this connection 
that, for the two oblique co-ordinate systems, the property of being 


*These names are chosen with regard to the manner in which the components behave 
when subjected to a co-ordinate transformation. Thus the sets of variables £1:++ &, and 
t*, +++ £*, are said to be contragredient because when one set is subjected to a nonsingular 
linear transformation, the other is subjected to a linear transformation with the reciprocal 
matrix. For the moment, these matters need not be of interest, but they may be demon- 
strated from Eqs. 92 and 97 which follow. 
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reciprocal is strictly a mutual one. Hence the £*;,’s may just as well be 
looked upon as the contravariant components of %, in which case the 
&,’s become the covariant components. 

According to Eqs. 43 and 55, 


Gz] = Q* Ee] [71] 
Hence, by Eqs. 34 and 46, . 
GQ,é] = Se] = #] [72] 


It is thus seen that the covariant and contravariant components of the 
-- vector x are related by the matrix G. 
Another interesting pair of relationships is obtained through forming 
the scalar products a,-x and a*,-x, using for x the Eqs. 26 and 25 
respectively, and noting the conditions 31. This procedure gives 


aX = e*, [73] 


and 
a*, -x= & [74] 


With reference to Fig. 1, a,-% for k = 1 and 2 are the orthogonal 
projections OM and ON of the vector x upon the axes =, and Zp. 
Equation 73 states that the lengths of these projections, measured with 
the scales of the =; and =, axes, are numerically equal to the covariant 
components £*, and é*, of x, and may be substituted for these in Eq. 70 
in evaluating the length of the vector x. Similarly, o*, - for k = 1 and 2 
are the orthogonal projections OM* and ON* of the vector x upon the 
axes E*, and &*2. Equation 74 states that the lengths of these projections, 
measured with the scales of the 2*, and 2*, axes, are numerically equal 
to the contravariant components é, and £ of x. 

In view of the relation 73, the covariant components of the vector x 
are sometimes designated as the orthogonal projections of the point P in 
Fig. 1 upon the axes 1 and 2 of the system © (as contrasted with the 
parallel projections which are the contravariant components). Although 
this procedure is justified numerically, and does away with the necessity 
of considering the reciprocal axes (or drawing them in the case of a 
graphical determination), it fails to give the true geometrical picture 
regarding the nature of the covariant components. 

The scalar product of two vectors x and y in terms of their co-ordinates 
in an oblique system is readily determined with the aid of the above 
considerations. The components of y in the systems X, =, and =* may be 
denoted by yz, 7%, and 7*,. The corresponding column matrices which 
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represent these components are 


v1 m1 | a 
ye ne no 
y=]. n=]. Wy =] - [75] 
Yn tn In 
The matrix Equations 43 and 55 written for x and y 
x] = Gg] = G*e"T [76] 
. y] = An] = A**] [77] 
admit of four alternative representations for the scalar product 
y= ah x9] [78] 
They are 
ay = §],X @Q, Xn] =) x8 Xa] [79] 
SAL X a x] =F xe* xa] - [80] 
= §], x @Q*, X n*] = En*] . [81] 
= Be x Q*G, Xn] = &) a] [82] 


The first two of these results are similar in form to the Eqs. 48 and 66 
except that the terms are bilinear in £; and 7% (respectively £*; and n*,) 
instead of being quadratic in é; (respectively é*;). The last two are the 
mixed forms, which read 


xy = En*, + fone +++ + Enn*n [83] 
xy = Bim + ong + +> + Ente [84] 


The first of these involves the contravariant components of x and the 
covariant components of ; in the second form these two types of com- 
ponents are interchanged. 

Except for the appearance of the two kinds of components, the ex- 
pressions 83 and 84 parallel the customary one in rectangular co-ordinates, 
namely 

LY = Ky + Lee fees + Xan [85] 


In a rectangular (orthogonal) co-ordinate system, the covariant and 
contravariant components are identical, since the system is its own 
reciprocal. 


6. TRANSFORMATION FROM ONE OBLIQUE SYSTEM TO ANOTHER 


In addition to the oblique co-ordinate system characterized in terms 
of the rectangular system X by means of the matrix @, a second oblique 
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system may be considered as characterized by a matrix & with the set 
of unit vectors 


b, = ur + U2dn2 terest UnDin (k = 1, 2, spe n) [86] 
The reciprocal of this second oblique system has the set of unit vectors 
BY, = uyD* ey + Mob ng +--+ + Unb*in = (Rk = 1,2,-+-m) [87] 
which is the vector set of the reciprocal matrix %*, that is, 
1 fort =k 
oF, = 
Bea io for i # k [88] 


The contravariant and covariant components of the vector x in the 
second oblique system are denoted by {% and ¢*;, with the column 
matrices ¢] and ¢*]. Then the vector x, in terms of its components in the 
rectangular system X, has the following representations: 


x} = @é] = G*e*] 

= Bt] = B*t*] 

These relations readily yield the transformations from either of the 
variables ¢, or £*, to either of the variables ¢;, or ¢*, associated with the 


second oblique system and its reciprocal. 
For example, from the matrix equation 


[89] 


QE] = B* *] [90] 
premultiplication by 
B = (By [9] 
gives 
BA, x é] = &*] [92] 


which relates the contravariant components of x in the first oblique 
system (these are the &’s) to the covariant components of x in the second 
oblique system. The resultant transformation matrix in this case is 


C=8~xQ@, [93] 
If this is written out as 
C11 C12 Cin 
@= €ai4 Cog *°* Con [94] 
Cni ne Cnn 


with the vector 
Ce = U1CKy + Uelee + °° ° + Unlun (k = 1,2,--+m) [95] 
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then it is clear from Eq. 93 that the elements of € are given by the scalar 
products 


Ci = b; * a [96] 


Hence the elements of the resultant transformation matrix ‘Eq. 92 
are seen to be the orthogonal projections of the unit vectors a; upon the 
axes of the second oblique system (measured with the scales belonging to 
the axes of this system), or alternatively they may be considered as the 
projections of the unit vectors b, upon the axes of the first oblique - 
system, measured with the corresponding scales carried by these axes. 

The transformation 92, with its resultant matrix determined as just 
described, transforms the contravariant components of x in the first 
system into covariant components in the second oblique system. 

Because of the mutual character of the relation between an oblique 
co-ordinate system and its inverse, as pointed out in the previous article, 
Eq. 92 evidently remains true if & is replaced by B* and simultaneously 
¢ is replaced by ¢*. This shift gives 


B*Q: x é] = 3] [97] 


which is an equation relating the contravariant components of x in the 
first oblique system to the same kind of components in the second system. 
The elements of the resultant transformation matrix are determined and 
interpreted geometrically just as for the transformation 92 except that 
the reciprocal of the second oblique system with its unit vectors b*, 
replaces the unit vector b;, with its corresponding axes. 

Further detailed discussion of these transformations becomes decidedly 
awkward in terms of matrix notation, as the reader can at this point 
readily appreciate by continuing the formation of various additional 
obvious relationships. Indeed, it is this circumstance which is the Lest 
justification for the introduction of the notation and the conventions of 
tensor algebra. The most important point in this notation lies in the 
designation of components in the reciprocal co-ordinate systems (co- 
variant quantities) by lower indexes (subscripts) and those in the given 
co-ordinate systems (contravariant quantities) by upper indexes (super- 
scripts) instead of by the placing of an asterisk on these quantities. 
The circumspection which results from this simple artifice alone may 
readily be appreciated even by the reader who has as yet no familiarity 
with the tensor notation. 

However, a detailed discussion of these matters at this point would 
lead the reader too far afield from the present objectives, which are to 
lay a general foundation for a more thorough consideration of this subject 
at a later time. 
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7, SYSTEMS OF LINEAR ALGEBRAIC EQUATIONS. 


As a result of the geometrical interpretations given in the preceding 
articles the problem of finding solutions to a set of linear simultaneous 
algebraicyequations may be discussed with a greater degree of clarity 
than is possible from a purely algebraic point of view. To start with, the 
number of equations is considered equal to the number of unknowns. 
The equations read 


OX, + Areke +++ + inka = V1 
g1%1 + dgoX%2 + +++ + Genkn = Yo [98] 
Oni%1 + Anoka +++ + Anntn = Yn 
with the matrix 
ai 12 Gin 
Q= ag, G22 *** Gan [99] 
Ont On2 °°" Onn 


The components 4@:%, %r, and 4, of the vector set a; and the vectors x 
and y respectively fork = 1,2,--- mare the ordinary components with 
respect to a rectangular Cartesian co-ordinate system in an m-dimensional 
space. 

The vector set a; (for 7 = 1, 2,--- nm) and the vector y are specified, 
and the problem is to find the corresponding vector x. The problem “of 
particular interest according to the discussion in Art. 9, Ch. I, is the 
question whether a vector # can be found to satisfy the transformation 
98 when ¥ is specified to be zero. In other words: Can the transform of a 


vector x be zero? 
In this case Eqs. 98 become homogeneous. In vector form they read 


a@,:% =0 [100] 


According to these equations, the vector x (if one exists) must be simul- 
taneously orthogonal to all the vectors a1, 42, °° * a in the vecior set of @. 
The possibility of the existence of such a vector evidently depends upon 
the rank of this vector set. If the rank is , then, according to the dis- 
cussion in Art. 2 above, the vector set utilizes all the available ” dimen- 
sions, and it is clear that no vector x can be found to satisfy the conditions 
100. In this case the determinant of @ is not zero. 
On the other hand, if the rank of the vector set of @ is (n — 1), one of 
the available » dimensions is left unoccupied. For example, if in three- 
dimensional space the vectors 41, @e, and ag lie in a plane, only two dimen- 


. 
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sions are utilized by this vector set, and any vector x normal to this plane 
constitutes a solution to the homogeneous Eqs. 100. The vanishing of the 
determinant of @ is seen to be the necessary and sufficient condition for 
the existence of such a solution. - 

At the same time it is also clear that the solution merely specifies a 
direction for the vector x; its length is arbitrary. This checks with the 
conclusion reached in Art. 9, Ch. I, on purely algebraic grounds, according 
to which only the ratios of the unknowns xj, %2, - +: X, to each other are 
determined from the homogeneous equations. The direction cosines of 
the vector x are given by the expression 


2 
et +o fae? = = tne 01] 
Xe Xk Xk 
for k = 1, 2, --+ m, and this expression involves only the ratios of the 


unknowns. 

If the rank of the vector set of @ is (n — 2), two of the available dimen- 
sions are left unoccupied. For example, if in three-dimensional space the 
vectors 41, dg, and ag lie in a straight line, any vector x lying in the plane 
normal to this line is simultaneously orthogonal to all three vectors in the: 
set, and hence any such vector constitutes a solution. Although an 
infinite number of x-vectors may be drawn in this plane, only two inde- 
pendent ones can be found in this way. Hence when the rank of the matrix 
@ is (n — 2), two independent solutions exist to the set of homogeneous 
Eqs. 100. 

In general, then, if the rank of @ is (w — p), the homogeneous equations 
have p independent solutions. In any case, however, the length of the 
vector x for any of the solutions remains arbitrary; and if the rank of 
@ is r < n, the equations determine only r of the x;’s in terms of the 
remaining ones. Choosing arbitrary sets of values for these remaining 
%j,’8 permits any number of solutions to be obtained, of which no more 
than » can be linearly independent. The x-vectors corresponding to ~ 
independent solutions are in general not mutually orthogonal to each 
other, but a set of » mutually orthogonal solutions may be found by 
proceeding in the following manner. 

If the vector set a1, d2,--- dq has the rank r, it is surely possible to 
find r linearly independent vectors in this set. The numbering in the 


original set may be changed if necessary so that the vectors a1, dg, - ++ a, 
are the linearly independent ones. Then the r equations 

Oy1%y + Ay9% +--+ + Gin%n = 0 

1% + AgoXg + +++ + dontn = 0 [102] 


i  ) 


GpyXy + ApgXg + +++ + Arn%n = 0 
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uniquely determine 7 of the x,’s in terms of the remaining ones. That is, 
a set of values 


YX, = 1b (for k= 1, 2, oe ie n) [103] 


of which p = n — rare arbitrarily chosen (not more than » — 1 of these 
may be chosen equal to zero, however) may be considered to be the 
components of the first x-vector constituting a solution. This is the vector 
% = a, with the components a4. | 


Now the vector set a,--- @, is augmented by having the vector a; 
associated with it. This augmented set is surely an independent one, 
because a, is orthogonal to all the vectors a -: + Gy. Hence the r+ 1 
equations 

Qy1%1 + A1%2 + +++ + nt, = 0 
wg hes ects tehsil lave bY) gre cent ie 
Ory + yo%g + +++ + rnin = 0 [104] 
1X1 + are%2 + +++ + ant, = 0 


uniquely determine r+1 of the x’s in terms of the remaining ones. 
That is, a set of values 


a, = om, (fork =1,2,++-”) | [105] 


of which p — 1 are arbitrarily chosen (not more than p — 2 of these may 
be chosen equal to zero) represents the components of a vector x consti- 
tuting a second solution which is orthogonal to the augmented vector set 
a, +++ @,, a, and hence orthogonal to the first solution « = a. This 
second solution is the vector « = a2 with the components ag,. 

A third vector « = a3 with the components ag, is found in like manner 
by solving the set of r + 2 homogeneous equations corresponding to the 
linearly independent vector set a] «+ - Gr, a1, 2. These solutions are the 
components 


LE = Age (for k = 1,2,--+m) [106] 


of which p — 2 are arbitrarily chosen. 

A continuation of this process finally yields a set of mutually orthogonal 
vectors a1, a2, °** &p, all of which are orthogonal to all the vectors in the 
set a1, @, +--+ dy. Such a set of mutually orthogonal solutions is, however, 
not unique, because of the arbitrary choice of p, p — 1, p — 2, etc., 
x,-values at the various steps in the process of solution as described above. 

The case for which the rank of the vector set of @ is r = 1 — 1 is most 
frequent in its occurrence in practice. Here only one independent solution 
(direction for the x-vector) exists. Because of the relation expressed by 
Eq. 46 of Ch. I and because of the fact that A = 0 forr = ” — 1, the 
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following relationship holds in this case (for all values of 7 and & from 
1 to n) 
. GiyAg + @igAne + +++ + dinArn = 0 [107] 


in which A;, are the cofactors of the elements a;, formed from the de- 
terminant A of the matrix @. 

If A; is defined as a vector having the components Axe, the relation 
107 in vector form reads 


a;- A, =0 [108] 


which states that the vector A; is simultaneously orthogonal to all the 
vectors in the set a, do, -+- @,. This vector, therefore, constitutes the 
desired solution. 

Note that the cofactors Az, for s = 1,--+m are formed for the ele- 
ments of the kth row of @. Any row may be chosen. If, however, all the 
cofactors of the elements of that row happen to be zero (as is possible 
because the rank is x — 1), the solution assumes an indeterminate form. 
However, since the rank is assumed for this case to be not less than x — 1, 
a row can surely be found for which not all the cofactors of the elements 
are zero. 

The direction cosines of the vector x constituting the desired solution 
are given by 


A ks . 
V Any? + Ang™ +++: + Ann? 


in which @, is the angle between the direction of the vector x and the axis 
s of the rectangular co-ordinate system. 

Another problem of special interest in connection with Eqs. 98 occurs 
when the rank of the vector set of @ is less than , but the y,-values on 
the right-hand sides of these equations do not happen to be zero. That is, 
the equations are inhomogeneous but the determinant of their matrix is 
zero. In Art. 9, Ch. I, it is pointed out that solutions may still exist if the 
y;'s have values satisfying Eq. 56, which reads 


Yi = Ay, + agdig + +++ + andin @=1,2,---m) [110] 


but that the solutions assume an indeterminate form when expressed 
according to Cramer’s rule. 

The a,’s in Eq. 110 are arbitrary factors. In vector foe this equation 
reads 


cos 6, = [109] 


y= aya,! + Aig," + Pee -+- Cn dn? [411] 


and states that the vector y is given by a linear combination of the 
transposed vector set of @. By making use of the relations 110, Eqs. 98. 
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may be rewritten in the form 


41%") + ayex’g +--+ + Oni" =0 

oyX"1 + Aag%’g +--+ + dant’n = 0 [112] 

Oni X'y + Angt’s + +++ + Gnnt'n = 0 
in which 

x, = Xp Or [113] 

Or if a1, a2, +++ @»_ are considered to be the components of a vector a, 
and x’, %’2,--+ ’, the components of a vector x’, 

vw =x-a and «=%7 +4 [114] 


The vector x’ is a solution to the homogeneous equations 112. The 
problem of finding solutions x’ when the rank of @ is r < m is discussed 
above, and the desired values of the vector x are given in terms of those 
for x’ by Eq. 114. 

When the vector y, in this type of problem (for which r < m), cannot 
be expressed in the form of Eq. 111 (in this case y is not a member* of 
the transposed vector set of @), the original set, Eqs. 98, is said to be 
inconsistent, and no solutions are possible. 

It is interesting that the discussion of this special problem incidentally 
offers also an alternative interpretation to the solution of the general 
inhomogeneous equations 98. This case is contained in the solutions to 
the homogeneous equations 112 for x’ when the rank of @ is m. Then the 
application of Cramer’s rule yields x’ = 0; that is, x’) = a’, =--+a', = 
O, and the desired solution for x follows from Eq. 114, namely, for r = n, 

x= a [115] 
In this case, Eq. 111 yields the following representation for the vector y: 
y = X01" + Xedq' + +++ + Sndn’ [116] 


which is an interesting alternative form for the linear nonsingular trans- 
formation. 

It appears from this discussion that the necessary and sufficient con- 
dition for the existence of solutions to the inhomogeneous equations 98 
is that the vector y be a member of the transposed vector set a1‘, ae’, 
-++a,', regardless of the rank of @. If the rank is , then :y must neces- 
sarily be a member of the transposed vector set (no matter what the 
values of ¥1---n may be) because the rank of the augmented set 
a;', dg, «++ ap‘, y cannot be greater than m (see footnote below). Then 


*If the rank of the transposed vector set and of that set augmented by the vector y are 
alike, then y is called a member of the transposed vector set, and hence is expressible in the 
form given by Eq. 111, 
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there exists a unique solution for x which is given by Cramer’s rule. 
When the rank of @ is less than , no solutions exist unless is a member 
of the transposed vector set of @, in which case m — r linearly independent 
solutions can be found. 

A summary statement regarding the possibility of finding solutions to 
Eqs. 98, known as the rule of alternatives, may now be formulated, as 
follows: 


_Ejither the matrix @ of Eqs. 98 has the rank 1, whence A = 0, and 


there exists one set of values x1, %2, -- + %, which satisfies the equa- 
tions for arbitrary values of 1, yo,-- + ¥n.—%in particular the trivial 
values 4, = % =---=%X,=0 resulting for y= y=--- = 


> Yn = O0— these solutions being given by Cramer’s. rule, 


or the matrix @-has the rank 7 = » — p (p # 0), whence A = 0, and 

the corresponding homogeneous equations have » independent 

solutions. The inhomogeneous equations then have solutions for 

particular values of +1, ye, - - - Yn only, namely those values for which 

' the vector y is expressible as a linear combination of the transposed 

“vector set of @. To these particular solutions may be added those for 
the corresponding homogeneous system. 


The more general case of m equations with n unknowns 


@y1%1 + Ay2%2 + +++ + Ginkn = V1 
GoiX1 + AeaXe + +++ + donkn = Yo [117] 
Omit, + AmeXe +++ + OmnXn = Vn 
may now readily be disposed of. Multiplying the equations respectively 
by the unit vectors 1, #2, -- > %, associated with the m axes of a rec- 
tangular co-ordinate system, and adding, yields the result 
y = X01! + Xodq! +--+ + Sndy! [118] 
where 
Y= U1 + Ue + +++ + Um¥m [119] 
and 
Gy! = Uy din + Udon +++ * + mde [120] 


are the vectors in the transposed set of the matrix @ of Eqs. 117. 

The result given by Eq. 118 states that for any set of values x1, x2, 
-++%, which simultaneously satisfies the m equations 117, the vector y 
is a member of the transposed vector set of @. In other words, unless y 
is a member of this vector set no solutions are possible, regardless of the 
rank of the set. This condition, which is arrived at above for the special 
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case m = n, is thus seen also to be the condition for the existence of solu- 
tions in the more general case m # n. 

When m ¥ n this condition is more significant, for if the rank is x 
and m =n, the condition 118 is automatically fulfilled; whereas when 
m > n, for example, and the rank is , y may or may not be a member 
of the transposed vector set. In other words, when m = n and the rank 
is 2, there is no question about the existence of solutions to the inhomo- 
geneous equations; but when m # n and the rank is n(<m) or m(<n), 
the existence of solutions still depends upon the fulfillment of the con- 
dition 118. 

If this condition is written 


Y= a0" + agg’ + +++ + ndn’ [121] 
and a1, a2, +++ a, are regarded as the components of a vector a, then by 
letting 

ev =x—-a’ x=2’+a [122] 


as in the problem for m = n, the question of solutions to Eqs. 117 may 
be discussed for the inhomogeneous and the homogeneous cases simul- 
taneously, because the resulting equations 


4124 + ayox'g +++++ dint’, =0 
ci See e ie = 0 [123] 


AmiX"1 + Ame%’g + +++ + OmnX'n = 0 


apply to the inhomogeneous equations 117 for a ¥ 0 and to the corre- 
sponding homogeneous equations for a = 0. 

As discussed in Art. 2 of this chapter, the rank of the nonsquare matrix 
@ of this set of equations is at most equal to the smaller of the two 
numbers m and n. If m > n, then r S n. Out of the m equations 123 it is 
possible to select r independent ones (the coefficients of these define 7 
independent vectors in the vector set of @). For these equations, m — r 
independent solutions x’ may be found by the procedure discussed for the 
case m = n and r S n. In particular, for r = n only the trivial solution 
x’ = 0 exists, which yields « = a. The solutions so found automatically 
satisfy the remaining m — r equations in the set 123 because the coeffi- 
cients of these equations are the components of those vectors in the vector 
set of @ which are linear combinations of the others. 

When m < n, then r S m. Equations 123 then yield m — r independent 
solutions which again are found as for the case m = ” and r S m. In 
particular, for r = m there are n — m independent solutions. 
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8. ON THE RANK OF MATRICES HAVING A NULL PRODUCT 


If @ and & are two square matrices of order n, their product, 
C=@a@xB [124] 


is also a square matrix of order » with coefficients given by the scalar 
products 


Cy = A: b,? [125] 


in which @;, @g, + - + dy is the vector set of @ and b,¢, by', - - - b,,“ is the trans- 
posed vector set of 8. 

All the vectors a; can be orthogonal to the vectors b,’, in which case 
all the coefficients c,, are zero. It thus becomes clear that the matrix 
product @ X & may be zero even though neither @ nor & is zero. This is 
an important difference between the laws of matrix algebra and those of 
scalar algebra, according to which the relation ab = 0 requires that 
either a or 6 be zero. 

The vector interpretation of Eq. 125 allows the following conclusions 
to be drawn when C = 0: ; 


(a) If @ is of rank n, then B must have the rank zero; that is, in this 


case B = 0. 
(b) If neither @ nor & is zero, then the rank of both must be less than 
n. 


The first of these conclusions follows from the consideration that if the 
vector set a; has the rank m (occupies all the available dimensions) no 
vector exists which is simultaneously orthogonal to all the vectors a;; 
that is, 6,’ = 0. In order that any vectors b;’ may exist, the vector set a; 
must evidently have a rank less than ”, and conversely the rank of the 
vector set 5,’ must be less than x in order that any nonzero a; vectors 
may fulfill the condition a; - b,’ = 0. 
If the null condition 


a;° by! =0 [126] 
be compared with the system of homogeneous equations 
a;-% =0 [127] 


the discussion of the previous article yields the conclusion that if the rank 
of the matrix @ is rg S nm, then m — r, independent vectors b,¢ exist which 
fulfill the null condition, and hence the rank 7 of the matrix 8 may be as 
large as but no larger than ” — rq; that is, 7» < m — 7g. Thus if 


@xB=0 [128] 
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then 
Ta tt Sn [129] 


in which @ and & are square matrices of order m with the ranks 7, and 
ry respectively. 

It should be observed that the converse of this statement is not neces- 
sarily true, nor does it follow that the reversed product B@ is zero when 
QQ is zero. 

An interesting application of this result is found in considering the 
product of a matrix @ and its adjoint @*. From the definition of the 
adjoint matrix (see Art. 6, Ch. II) and determinant theory (see spe- 
cifically Eqs. 45 and 46 of Ch. I), it follows that 


A 0 0-0 
axa@=axaa}O 4 0 0 t=au [130] 
0 0 A 


in which A is the determinant of @. Hence if the rank of @ is (m — 1), 
A = 0, and according to the relations 128 and 129 the rank of @* can 
be no greater than 1. Since the elements of @* are the cofactors of @, and 
at least one of these is not zero because the rank of @ is (mn — 1), the 
rank of @* cannot be less than 1. Hence the summary: 


If @ has the rank m, then @? has the rank n. 
If @ has the rank » — 1, then @? has the rank 1. 
If @ has any rank less than  — 1, then the rank of @* is zero. 


In connection with considerations of this sort it is useful to observe 
that if a given square matrix @ has the rank 1, it possesses no more than 
one independent row and one independent column of elements, and hence 

it may be represented as the product of a column and a row matrix, thus: 


@=|  |xX[hi b2--: Bal > [131] 


Similarly if a given square matrix has the rank 2, it possesses no more 
than two independent rows and two independent columns, and may 
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evidently be represented as 
11 12 
@= : ; Aone | 
; ; e Ki +++ Ban [132] 
Ani Ane 


The generalization of these statements is obvious. 


9. SYLVESTER’S LAW OF NULLITY 


From the preceding considerations it follows that if a square matrix @ 
of order is given, and if this matrix has the rank 7, and the nullity ~,, 
‘a nonsingular matrix % can be found such that the product 


. C=@xB [133] 
has ~, null columns. For example, if , of the columns of & are the 


components 61%; box, - + - bax Of the vectors 5,’ (k = 1, 2, -++ pa), which 
are independent solutions to the homogeneous equations 


ay: b;! = 0 

t- 
eee [134] 
Qn: by! = : 


pa of the columns in €C are composed of zeros. The remaining r, columns 
in & may be chosen arbitrarily as long as they are independent, that is, 
as long as & turns out to be nonsingular. 

There is no nonsingular matrix & for which C has more than #, null 
columns, because (since % is nonsingular) C has the same rank as @ (see 
Art. 9, Ch. II) and hence must have 7, independent columns. If, how- 
ever, pp of the remaining columns in & are then chosen to be linear 
combinations of the above #, columns, € has p, + #» null columns, and 
® is singular with nullity ~,. Moreover any matrix & with nullity pp 
_ which produces . + » null columns in C must have the form of the 
particular one chosen here, for 2 + f» of its columns must satisfy Eq. 
134, and » — #, of its columns must be independent. In addition, it is 
not possible for any matrix & of rank 7, to produce more than pa + pp 
null columns in €, because any such new column requires that another 
column in & become a solution to Eq. 134, and thereby be a linear 
combination of ~, of the independent columns already in &. The rank of 
& would then have to be less than 75. The nullity of C is therefore at least 
ba + py. But it cannot be greater than this because, if it were, post- 
multiplication of Eq. 133 by a nonsingular matrix S could produce 
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another null column in C. This would demand that BF be a matrix of 
rank r, producing more than ~, + , null columns in C, which has been 
shown to be impossible. Hence no matrix & of nullity p, can give Ca 
nullity greater than pa + fz», that is, 


be S ba t+ bo [135] 


Of course pa + » may exceed n, in which case evidently p, cannot be 


greater than 1. 
Now let it be supposed that two nonsingular transformation matrices 


J 4 and Jz are determined such that 
@’=T,a [136] 


is a matrix whose first 7. rows are independent and the remaining ones 
null, and 

B’ = BF [137] 
is a matrix whose first 7» columns are independent and the remaining 
ones null. Then 


TJ <@BTy = Q’B’ = TsCTz = C’ [138] 
has the same rank as C, but is evidently in the form 
(iy °° Cir, 0 eee 0) 
lrat °° * Cran Q0--- 0 
eg ve 0 [139] 
0 0 


It then becomes clear that the rank of @’ and hence that of € cannot 
exceed 7, OF rp, whichever is the smaller. Or the nullity of C must be at 
least as great as Pq Or P», whichever is the larger. 

Together with Eq. 135 this conclusion yields the summary 


Pr S Pa + bo [140] 
Pa > Po 
cz 
‘ oo 
or stated in terms of rank, 
Te Z ta tt — [141] 


Ta <% 
< 
fs { TT < 1% \ 
This useful statement regarding the rank or nullity of a product matrix 
in terms of the ranks or nullities of its components is known as Sylvester’s 


law of nullity (or degeneracy). 
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10. REDUCTION OF A SQUARE MATRIX TO THE DIAGONAL FORM OF 
ITS LATENT ROOTS; NORMAL CO-ORDINATES 


In view of the vector interpretation of the linear transformation 
Qi + + Ankn = N 
snp map bhaoiel Sige nceate aGeesn [142] 
OnyXy +++ + Onntn = Vn 
as discussed in Art. 3 of this chapter, it is interesting to inquire whether 
there exists a vector x whose transform ¥ has the same direction in space, 
that is, whether a solution to Eqs. 142 exists which conforms to the 
stipulation 
y= de [143] 


or more specifically 
: ye = dx; = (fori = 1,2,+++n) [144] 
in which 2 is a nonzero™ factor. 

Substituting the values given by Eq. 144 for the right-hand members 
of Eqs. 142, and transposing these to the left-hand sides, yields the condi- 
tions 

(a1 ~ d)x4 + 1 2X2 pes A1nXn = 
oi%1 + (deg — A)te + +++ + Gondn = 


Oni X1 + An2%2 Bee @ (@nn = A)Xn = 0 
This set of homogeneous equations has nontrivial solutions only when the 
determinant of its matrix (called the characteristic matrix of @) is zero, 
that is, when 


oo 


[145] 


(a1, ~ 0) a2 cies Ain 
dat (age _ ) ea5 don <a 0 , [146] 
Qn Ono hae (nn = d) 


This determinant is called the characteristic function of the matrix @. 
It is evidently a polynomial in \ of the mth degree. The condition equation 
146, which is called the characteristic equation, is an algebraic equation of 
the nth degree in 4, and hence there are values of for which the 
matrix of Eqs. 145 has at most the rank (# — 1). These so-called latent 
roots or characteristic values may be denoted by Aj, Az, +++ An: 

In the general case for which the matrix @ of the transformation 142 is 
unsymmetrical, the latent roots may be real or complex. Also they may 


*The stipulation ) = 0 leads to the inquiry regarding the existence of solutions to the 
corresponding homogeneous equations, which has been discussed previously. : 
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be distinct, or there may be coincident roots. If all the latent roots are 
distinct, the rank of the characteristic determinant for any particular 
latent root \, cannot be less than (n — 1); for if it were, all its (n — 1)- 
rowed minors would. be zero, and hence its derivative with respect to » 
would also be zero when \ = 2,. This fact would indicate that \, must be a 
multiple root of Eq. 146, which contradicts the assumption that the 
latent roots are distinct. 

When the roots are distinct, one may conclude that there are m distinct 
directions for the vector x which remain unchanged by the transformation 
142, that is, there are » directions for which the vector y coincides with 
the vector x. These are given by the vectors %, %, - - » which are found 
to be solutions to the 1 sets of homogeneous eons 145 resulting from 
substituting successively the characteristic values \i, Ag, --~ An for A. 

If the cofactors of the elements of the determinant in Eqs. 145 for the 


root \ = X, are denoted by &,, the direction cosines of the vector % are, 
according to Eq. 109, given by 
& 


k; 
Lig = aa ESAT 
(ki)? + (Ria)? + -++ + (Rin)? 


in which the index 7 is arbitrary but must, of course, be the same for the 
determination of all the direction cosines corresponding to a given ° 
root Ag. 
The components 1, 22, : * : %n of the vector may be taken equal to the 
s s 


(fork =1,2,---m) [147] 


cofactors kj, hia, +++ Rin respectively (7 arbitrary) or to any multiple of 
these values since the length of the vector is not determined by Eqs. 145. 


Unit length for the vector x results when its components 2%, %2, °** Xn 
are taken equal to the direction cosines ?1s, fos, -: > £ns respectively. 
With the aid of the column matrices 

8 

a1 f ls 

%2 fos 

ieee ies ae Ea ee [148] 
Bn ts 


the transformation 142 for the condition 144, appropriate to the root 
> = Ag, may be expressed by the matrix equations 


Qx] = rx] [149] 
or 
@?,] = rele] [150] 
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There’ are » matrix equations of this form for the roots corresponding 
to the integer values 1, 2, - - -. for the indexes. 

These matrix equations may be combined into a single one by de- 
fining the matrix 


fir Paya t Ptn 
) cas oe 

MS a ok acatee (151) 
Pai Pre eae fan 


in which the first column is the column matrix 148 for s = 1, the second 
column is the column matrix 148 for s = 2, and so forth. The set of 1 


matrix equations 150, for s = 1, 2,--- is then given by 
Alia Aafia + * Anfin 
ae a Arta A2lea ee Anfen [152]. 


ArPat Aelna tee Anban 


or, by the introduction of the diagonal matrix 


1 0 O---90 
pe Oy [153] 
0 O An | 
this is 
Q2=La [154] 


It may now be shown that when the )-roots are distinct the matrix & 
given by Eq. 151 is nonsingular or that the m vectors 4, %, --- % whose 
directions remain unchanged by the transformation 142 form a linearly 
independent set. The proof is readily given by assuming that this vector 
set is linearly dependent and showing that such an assumption leads to an 
absurdity.* 

The assumption that the vector set %, %, -- - % is linearly dependent is 
equivalent to the statement that the vector relation 


rik + yo + +++ + rat = 0 [155] 


can be satisfied for y-values other than all zeros. This vector relation is - 
alternatively expressed by 


. End] =0 [156] 


*The proof given here was suggested by Professor D. J. Struik of the Department of Math- 
ematics at the Massachusetts Institute of Technology. 
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Multiplying Eq. 149 by y, and summing over s gives 
E Grd] = @E rd] = Bar] [157] 
and in view of Eq. 156 this yields 
= rad] = 0 [158] 


Repeating the same process but with Eq. 149 multiplied by A, in addition 
to ys, and this time using Eq. 158 instead of Eq. 156, gives 


3 


Yehs2%] = 0 [159] 
1 


s= 


In other words the assumption expressed by Eq. 155 or Eq. 156 together 
with the matrix Eq. 149 leads successively to relations of the form 


yedrs"x] = 0 [160] 
t 


& 
I 


or in vector form 
vyimh + yoda +++ +b yada = 0 [161] 


in which the exponent & can be 0, 1, 2,---. For & = 0, Eq. 161 is the same 
as Eq. 155. 

Since the A-roots are assumed to be distinct, this result means that 
any number of independent relations of linear dependence among the 
vectors 4, %, --- % exist—clearly an absurdity. For example, in three- 
dimensional space, the existence of one relation of linear dependence 
means that the vectors lie in a plane; the existence of two independent 
relations of-linear dependence means that the vectors lie in a straight 
line; the existence of three independent relations of linear dependence 
requires that the vectors be zero. 

If two of the \-roots are coincident, it might appear that two of the 
vectors x become equal (or at most proportional), since they then repre- 
sent two solutions to the same set of homogeneous equations 146. Under 
such conditions the modal matrix £, given by Eq. 151, would become 
singular by reason of the two proportional columns within it. It must be 
recognized, however, that the solution 147 for the various vectors x = ?, 
is based upon the fact that the rank of the characteristic determinant is 
exactly m — 1, and although it is surely true when the d-roots are dis- 
tinct, it may or may not be valid in the case of repeated roots. In fact it is 
shown in Art. 4, Ch. IV, that, if the matrix @ is symmetric, the occurrence 
of a repeated d-root of order p S requires the rank of the characteristic 
determinant for that particular value of \ to be m — p. It is then possible 
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by the methods discussed in Art. 7 of the present chapter to find p 
independent solutions to this single set of homogeneous equations, and 
to make these solutions orthogonal and of unit length if desired. Under 
such conditions, the present proof of the independence of the solutions 
for all the \-roots may be carried through without significant alteration, 
bearing in mind that the p solutions corresponding to the p equal d-roots 
are already chosen to be independent of one another. 

Unfortunately in the general case, when the matrix @ is not symmetric, 
there is no unique correspondence between the order of a repeated \-root 
and the rank of the characteristic determinant. The nullity of the char- 
acteristic determinant may be less (but not greater) than the order of the 
repeated root, in which case solutions for 2 independent vectors + cannot 
be found. For example, the unsymmetrical matrix 


110 
011 
00 1 


has a repeated latent root \ = 1 of order 3, whereas the nullity of the 
corresponding characteristic determinant is only 1. Since the rank of the 
latter determinant is 3 — 1 = 2, it is possible to find only one independent 
vector x, and therefore any attempt to form a modal matrix yields a 
singular result. Hence the existence of a nonsingular modal matrix £& is 
guaranteed when and only when the matrix @ is either symmetric, or 
has # distinct latent roots. 
In this case Eq. 154 may be premultiplied by £~ to give 


LAL =A [162] 
The matrix @ is thus reduced to the diagonal form of its latent roots. 


The significance of this reduction, as far as the transformation 142 is 
concerned is recognized when the latter is written in matrix form, 


Ga] = y] [163] 
and the changes of variable expressed by 
a]=2x'] and yj] =£y'] [164] 
are considered. Then, in view of Eq. 162, the transformation 163 becomes 
| Aa”) = y'] [165] 
which represents the equations 
Me = 4 
Aet’s = y's [166] 


Ce ey 
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It thus becomes clear that the change of variable expressed by Eq. 164, 
which amounts to transforming from the co-ordinates x and y to new 
co-ordinates x’ and y’, reduces the set of simultaneous equations 142 to a 
set of distinct equations, the solutions of which are immediately set 
down. Another way of looking at this situation is to say that in the new 
co-ordinate system to which x’ and y’ refer, the variables x’, --- x’, are 
no longer dependent upon each other as are the variables 4 --+%, in 
the original co-ordinate system to which x and y refer. 

In connection with physical problems, the new co-ordinates in which 
the equilibrium of the system appears to be expressed in terms of separate 
equations are called the normal co-ordinates of the system. The trans- 
formation to normal co-ordinates, which is effected by the matrix &, 
apparently isolates the various modes of behavior of the physical system, 
and for this reason the matrix £& is spoken of as the modal matrix. 

In view of the fact. that the vector x, which constitutes a solution to 
Eqs. 146, is determined in direction only, it may be noted that the matrix 
Q is still reduced to the diagonal form of its latent roots if in the relation 
162 the modal matrix & has its columns multiplied by arbitrary nonzero 
factors. In other words, the matrix & in Eq. 162 may be replaced by the 
product £9 in which D is an arbitrary nonsingular diagonal matrix. 
Since the inverse of 2D is DL, and DAD = A, the truth of this 
statement is evident. 

The type of transformation 162 to which the matrix @ is subjected 
evidently corresponds to identical transformations for the variables, as 
shown by Eqs. 164. Thus a transformation of the matrix @ of the form 


Cac = B [167] 
corresponds to a transformation of the variables as expressed by 
x] =Cx’} and y] = Cy’) [168] 


in which C is necessarily nonsingular. 

The geometrical significance of these relations can be studied through, 
first of all, recognizing that the transformation 142 may be looked upon 
as determining a point , -- - yn lying on a plane through the origin of 
co-ordinates defined by the equation 


Bi + Bove + +++ + Bn¥n = 0 | [169] 


corresponding to a given point x - - - x, on a similar plane defined by the 
equation 


aX, + AgXe + oe. + AnXn = 0 [170] 


It may be helpful in this connection to consider the coefficients a - ++ an 
to be the components of a vector a. Then Eq. 170 states that a- x = Q, 
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which requires that the vector x be orthogonal to the vector a, and hence 
that the point x, - - - x, lie in a plane normal to the direction of a. Equa- 
tion 169 may similarly be interpreted, and it follows from the solution 
of simultaneous equations that the components of the corresponding 
vector @ are given by the relation 


be =a*,-a (fork =1,2,---n) — [171] 


in which a*, ---a*, is the vector set of the matrix @* reciprocal to @. 
According to Eq. 74, the coefficients 6, are thus seen to be the contra- 
variant components of a with respect to the oblique co-ordinate system 
defined by the vector set of @. 

If the matrix @ is nonsingular, the transformation 142 yields a one-to- 
one correspondence between points on the two planes defined by Eqs. 
169 and 170. One may say that Eqs. 142 transform planes into planes 
and hence also straight lines into straight lines.* For this reason a linear 
transformation is also spoken of as a collineation. 

If the co-ordinates x1 ---%, and 4, --~- yn define the points P and Q 
respectively, the collineation 142 relates any chosen points P,, Po,--- 
(not necessarily on the same straight ljne) to corresponding points 
Q1, Qe,-++. The foregoing discussion in this article shows that a non- 
singular collineation possesses ” distinct points P,--- Pn (called the 
fixed points) to which the corresponding points Q, ---Q, are so related 
that the directions OP; and OQ, (O denotes the origin of the co-ordinates) 
are coincident, and OQ;/OP;, = x. This so-called descriptive or projective 
property is evidently placed in evidence by the collineation which has 
the diagonal matrix A. The collineations with the matrices @ and A have 
the same projective properties, and are therefore spoken of as being 
equivalent. 

More generally, any two collineations with the matrices @ and & related 
by Eq. 167 have the same projective properties. Equation 167, which 
expresses the equivalence of the two collineations, is called a collineatory 
transformation of the matrix @. 

The characteristic matrices of @ and & are related by the same trans- 
formation. With the aid of the unit matrix %U of like order, the character- 
istic matrices of @ and & are expressed by 


(@ — »U) and (B— AU) [172] 


*The truth of the latter part @f this statement may be seen from the fact that the trans- 
formation 142 relates points on a given continuous curve to points on another. continuous 
curve. If a plane 1 is transformed into a plane >’, and a second plane 2 (not parallel 
to #1) into the plane t's, points on the linear intersection of 1 and #2 must correspond to 

- points on the linear intersection of p’1 and #’2; for these points are common to the planes 
pi and p2 and, because of the continuity of the transformation, they must also be common 
to the planes ~’; and ’». 
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But by Eq. 167 
CA — aAU)E = C7AC — aC*UC = B— r~U [173] 


The characteristic function of & (determinant of the characteristic 
matrix) is therefore 


[B — ru] = |C*(@ — au)e| . 
= C@ — rulc = |@ — »U| [174] 


in which the bars enclosing the matrix expressions indicate that the 
determinant of the enclosed matrix is meant, and C is the determinant 
of the matrix C. 

The important conclusion to be drawn from this last relation is that 
the two matrices @ and &, which are connected by the transformation 
167, have the same characteristic equation and hence have the same 
latent roots. In other words, the latent roots of a matrix are invariant to a 
collineatory transformation. 

In this connection it is useful to note that Eqs. 162 and 167 yield 


L1CBOVL =A [175] 
from which it is clear that the modal matrix for & is given by the product 
CoS. 

11. THe CayLEey-HAMILTON THEOREM 


‘An interesting relationship follows from Eq. 162, namely 


Q@x@=@j=Lil fal = 2xe [176] 
in which 
1? 0 0 0 
SA | Oe OD [177] 
0 = 60 An? 


This relationship may readily be generalized with the result that the mth 
power of a nonsingular square matrix @ is given by 


Qn = Lange [178] 
with | 
m™ 0 0O--- 0 
am {| 2 %™ O--- 0 179] 


CY 
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If P(A) is a polynomial in \, that is, 
P(d) = pd? + GpydP?7! 4 +++ + ad + a [180] 
the result expressed by Eq. 178 shows that* 
P(Q) = a,@? + ap_s@?! +--+ + aA + ap VU 


= £P(A)e- [181] 
According to Eq. 179, however, 
Pi.) O O-+ 0 
POS ge aces [182] 
0 0 P(dn) 
and hence a polynomial in terms of a nonsingular matrix @ becomes 
P(1) 0 oO -:-- 0 
P@) =e x] 0° 200) OO xe [183] 
0 0 P(n) 


This result becomes particularly interesting when the polynomial P(A) 
is chosen to be the characteristic function of the matrix @, that is, 


P(r) = |@ — »U| [184] 
Then P(\) = 0 is the characteristic equation and hence P(\,) = 
P(\2) = +--+ = P(\,) = 0. The right-hand side of Eq. 183 is then zero, 


‘and the left-hand side is the characteristic function of @ with the matrix 


@ itself substituted in place of the variable \. The conclusion follows that 
the matrix @ satisfies its own characteristic equation. 

The above derivation requires that the modal matrix £ be nonsingular, 
which means that the matrix @ is either symmetric or has distinct 
latent roots. In spite of these restrictions on the method of derivation 
employed here, it can be shown that the last result, which is known as 
the Cayley-Hamilion theorem, is true in general for any square matrix.} 

A numerical illustration is given by the following: 


O 2 84), 3 
@e=l-1- 2 =1 [185] 
1 1 4 


*Tt should be observed that the constant term in the polynomial P(@) is ap@°, and that 
the zero power of a matrix is equal to the unit matrix of like order. 

See, for example, L. E. Dickson, Modern Algebraic Theories (New York: Benj. H. Sanborn 
and Co., 1926), p. 48. 
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The characteristic equation is given by 


—r —1 —3 
—1 (2-2) -1 =0 
1 1 (4 — r) 
which yields 
3 — 6? + 111-6 =0 [186] 
or in factored form 
(’— 1) — 2) — 3) =0 [187] 
Hence the Cayley-Hamilton theorem in this case states that 
@? — 6@? + 118 —- 6U =0 [188] 


or in factored form that 
(@ — U)(@ — 2U)(@ — 3U) = O [189] : 


Sieg the matrix @ from Eq. 185 into the last equation yields the 
identity 


= ae oe —2 -1 -3} [-3 -1 ~—3 
—1. 1 -1{x]{-1 0 -1]x}-1 -1 -1]=0 [190] 
I 4. <3 1) 1034.2 a 


which the reader may readily verify for himself. 

It is worth noting from Eq. 188 that higher powers of the matrix @ are 
expressible in terms of powers up to and including the (m — 1)th. Thus, 
for example, 


@= 6@?—11@ + 6U 
@Q*= 6@?—11@24+ 6@ =25@?— 60@ + 36% [191] 
@ = 25@? — 60@? + 36@ = 90Q2 — 2392 + 150% ~ 


and so forth. : 
In a similar manner, negative powers of @ may be calculated. Thus 
from Eq. 188 in the above example 


6G" = @? — 6@ + 11U 
627 =@ —6U + 11@* = 1414? — 10@ + §5% [192] 


and so forth. In particular, the inverse of @ by this method is found to be 


9 1 7 
@1=1@7-A+ Wy = | 3 3 | [193] 
—-3 -1 —-1 
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12. SYMMETRICAL TRANSFORMATION 


Returning to the discussion of Art. 10, note that when the matrix of 
the transformation 142 is symmetrical, the vectors * whose directions are 
left unchanged by the transformation form a mutually orthogonal set. 

The truth of this statement may be seen by considering Eq. 149 for 
two particular latent roots 4, and d,, thus 


Qi] = rH] [194] 
GE] = 44] [195] 
The transpose of the matrices in Eq. 194, postmultiplied by %] reads 
; 4],2 x] = eX] i] [196] 
and Eq. 195 premultiplied by the transpose of ] is 
. 4] ,Q%] = re] i] [197] 


Now if @ is symmetrical, it is equal to its transpose @,, so that Eqs. 196 
and 197 then yield 


et] E] = et] 4] [198] 
or 
. (As — Ax )¥]a4] =O - [199] 
Written out, this reads 
By 
(rex) [te + + + Sn] ; = (Ng — x) rE + hake t+ +++ +3ndn) = 0 
; [200] 
Ln 


which is alternatively expressed by the scalar product 
(A, — 4): # = 0 [201] 


Since this result states that the scalar product of two vectors corre- 
sponding to two different latent roots is zero, the vectors must be normal 
_ to each other. In the case of repeated -roots, it is pointed out in Art. 10 
of this chapter that the corresponding % and % can always be chosen to be 
orthogonal. Thus the apparent failure of the conclusion drawn from 
Eq. 201 in this instance is not significant. 
In view of Eq. 201 it is also easy to show that the latent roots of a 
symmetrical matrix must be real. For if they were complex they would 
have to appear in pairs of conjugates,* for example, 


Nw=atjes and y=a-—Jp [202] 


*Since the matrix is assumed to have real coefficients. 
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and the vectors % and % satisfying Eqs. 194 and 195 would likewise 
have to be conjugate complex, thus 


x=u+jv and #=u—jo [203] 
Their scalar product is given by 
e-E=u-ut+ov-v+jiu-v—v-u) ~ [204] 
which is a real nonzero value. Hence Eq. 201 would demand that A, = dz 
or 
a +768 = a — 78 [205] 
whence 
B=0 | [206] 
In other words, the latent roots must be real. 
When the vectors « form a mutually orthogonal set, the modal matrix 
given by Eq. 151 becomes orthogonal, because its elements are then the 


direction cosines of a set of mutually orthogonal axes. Hence when the 
matrix @ is symmetrical, its modal matrix has the property expressed by 


; &, = x [207] 
and the transformation of @ to the diagonal form of its latent roots reads 
&GL=A [208] 


In this case, the normal co-ordinates of the physical system whose 
equilibrium is expressed by Eqs. 142 are orthogonal to each other. 

Since the matrices encountered in practical problems are predomi- 
nantly symmetrical, the results of the present article are quite significant. 
These matters are elaborated upon in Ch. IV. 


PROBLEMS 
1. Determine the rank of each of the following vector sets: 


5, 3, ~—1) t 3, 4, ~1) fe 2, 4) 
(a) ; (10,0, 2) = (b) 4(—1.5,2, -0.5) (ce) 4 (1,3, 9) 


( 0, 1, 0) ¢ 6, 8, —2) (4, 4, 16) 
(ay {¢ 34, 60) @,1,1;1) (1, 1, 1) 
\(— 12a, 188) 1 2, 2,2, 2) @) 40,684, germ) 
(3:3, 3,3) (1, e979), e444 /3)) 
(4, 4, 4, 4) 


d, 2?, 23, apie 2n-1) 
(1, 33, 38) ++ 34) 
(g) 5 U1, 43, 4%, --> 49-4) 


(1, m?, m8, +++ nt) 
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2. A given vector set @1, a2, +> @m in an 7-dimensional space (with m < 1) is 
linearly dependent as expressed by 


101 + A202 ees 4mm = 0 


Show that all determinants of order m, formed from this vector set, vanish identically. 
What can you say about the rank of the vector set if a// the a’s have finite nonzero 
values? If all but one of the a’s are zero? 
Which of the following vector sets is linearly dependent? What is the rank of each? 


( 6a, —2b, —4c, 2) (6, 7, 9, 11, 13) 
( a, b, : ¢, d) (2, 4, 6, 8, 10) 
(-3e, 6, 2c, —1) 


3. Consider the vector set a4, @2, > > * @m in an m-dimensional space with m = n + 1, 
and suppose that the first 7 vectors a1, - + * d, are linearly independent. Show that one 
may write 


Om = O1y04 + Oj902 +°°* + O1ndn’ 


and express the coefficients a1, in terms of the determinant of the vector set a), - - - dn, 
its cofactors and the components of the mth vector. 

Generalize this result for the case m =n + p with p > 1, and illustrate with the fol- 
lowing numerical example: 


(—1, —2, —3) 
( 2, 4, 5) 
(1, 1, 1) 
( 1, 3, 5) 
(-1, 0, 1) 


4, Consider three vectors in three-dimensional space. The vector a, has a length of 
eight units and is directed at an angle of 45° relative to the co-ordinate axis 3, while 
its projection upon the 1,2-plane makes angles of 45° with both axes 1 and 2. The 
vector @ lies in the 1,2-plane, having a length of six units, and oriented at angles of 
30° and 60° respectively relative to the axes 1 and 2. The vector a3 has a length of 
four units and coincides with axis 3. Write down the matrix corresponding to this 
vector set. Express each of these vectors linearly in terms of the three unit vectors 
coinciding with the co-ordinate axes. 

5. The vector set a, @2, *** dp in an m-dimensional space has the rank m = n — 1. 
Show that 


n 
DY anAns.=90 fori =1,2,-++” 


and interpret the. significance of these equations geometrically. Suppose that this 
vector set is so chosen that the components a;, have.finite nonzero values. A set of 2 
vectors 61, bs, -+ + b» in an m-dimensional space is now defined with the components 
biz = d%/din. Obtain for this vector set the relations 


abi + edie +++ > +Ombin = 1 fori =1,2,---n 


giving the appropriate expressions for the coefficients a,, and show that all these 
vectors terminate in the same plane. What is the equation of this plane? 
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Illustrate this problem for 7 = 4 with the vector set 


( 1, 1,0, 1) 
( 2, 3,1, 1) 
( 4, 0,2, 1) 
(-2, —2, 0, —2) 


6, Consider again the vector set of the previous problem and show that 


n 
X onAin =0 fork = 1,2,---n 


A i= 
Assuming that the vectors are so numbered that Ann = 0, deduce the result 
Gy = O40, + ede +°°* + On-1dn-1 


giving the values of the coefficients a,. Determine this relation of linear dependence 
for the numerical vector sets: 


( 1, 2, 1, 0) (4, 2, 5, 1) 
( 3, 0, 4,1) (3,0,4, 1) 

34, 0, 0,1) G, 0,0, -4 
(-2, —4, —2, 0) (0, 3, 1, 1) 


7. Suppose the vector set a1---+ a, of a matrix @ in an n-dimensional space has 
the rank r = 2 — p (1 < p < 2), and assume the numbering of the vectors and com- 
- ponents to be so chosen that the rth order minor in the upper left-hand corner of the 
determinant A is nonzero. Obtain linear relations between each of the p dependent 
vectors and r independent ones by the following procedure. 

From the original matrix, form submatrices by striking out all but the first r 
columns and all but one of the last p rows. Call this row'the (y + i)th. There are p such 
submatrices as 7 goes from i = 1 to i = p, and each submatrix then consists of r 
columns and r + 1 rows. A group of 7 +1 minor determinants of order + is now 
obtained from each’ submatrix by deleting therefrom the kth row, with k = 4, 2, 
+++ +1. Each of the minor determinants so generated is prefixed by a sign-con- 
trolling factor (—1)‘+!*® to yield a “signed minor,” denoted by ai. Note that 


Ort = Ari = 6'* = Ap ry ¥ O. 
Show next that the following systems of relations hold 
aes G@ =1,2,-++p) 


Oy =0 
2, wore Gat oH) 


or that one may write the desired set of relations of linear dependence among the 
vectors: 

C1101 + O12d2 + +++ FOr rpidr41 = 0 

C2101 + Agee + +++ + O2.r410r42 = 0 


Opid1 + Apede + +++ + Ap rp10rp = 0 
Illustrate with the numerical vector set of rank 2: 
(2, 1, 6, 8) 
(1, 1, 3, 5) 
(0, 4, 0, 8) 
(0, 1, 0, 2) 
8. aes two vector sets @1°++ a, and b;-++d, with the square matrices @ 
and £8. 
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(a) If the vectors in the two sets are connected by the relations 
n 
Ze, Cina =b; fori =1,2,---n 
k= 


in which the matrix C is nonsingular, show that the components are connected by the 
matrix equation : 


Cx@=8 
and that the components of the corresponding transposed vector sets are related by 
a, x C, = B: 


(b) Regard a1-- +a, as a given vector set and b1--+- 5, as obtained from a1,--- ay 
through linear transformation with the nonsingular matrix C. Show that the rank of 
the vector set is invariant to this transformation but that such would not be the case 
if C were singular, 

9, Two vector sets a,:-+ a, and b,°-+5, with the square nonsingular matrices 
@ and & have the matrix products 


QB=af and GA =2 
(a) If the set a1---@, is transformed by a nonsingular matrix @ as indicated by 
ae-@ | 


how must the set 5,--- b, be transformed in order that P = P? 


(b) Derive the corresponding relation between 2 and 2, and show that the 
determinants of these matrices are equal. 

10. Let Pi and Pe be two points in an #-dimensional orthogonal Cartesian co- 
ordinate system. The vectors x; and xg with components 711°: + Xin and %91° ++ Xen 
respectively emanate from the origin of co-ordinates and terminate upon the points 
Py and P: 2. 

Express the direction cosines cos a1, COS @e, - + * cosa, of a vector drawn from P2 
to P in terms of the components of 1 and x2 ta the distance / between P; and Po, 
and prove the identity 


— cos? a, + cos? ag +: ++ + cos? a, = 1 
11. With respect to an oblique co-ordinate system, the contravariant components 
of a vector x are denoted by £1, £2, +--+ &, and its covariant components by £*1, £2, 


-. &*,. The corresponding components of another vector y are m1, 2, °++ 7, and 
n*1, n*2, °° + 1*n. If the contravariant components of the two vectors are related by the 


matrix equation 
[+++ &] x@ =[m--: ml 


and the covariant components by a similar equation with the matrix &, how is B 
related to @? 


12. In an oblique co-ordinate system two vectors x1; and x2 have components which 
are conveniently expressed as the elements of the following row matrices: 


[Eu E12 s+ Ein] Et Exe + ++ Een] (contravariant) 
[11 tie +++ Fan] (Ee. EYen «+> Fan] (covariant) 


The lengths of these vectors are I; ee la, @ is the angle between them, and d is the 
-distance between their tips. 
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(a) Prove that: 
d? = [1% — Eau] [Ete — Prone [1] 
and 
cos @ = [Aix} X [A*ox]e = Prox] X D*12): [2] 


in which Nok = En/Ls and N* 5% = &,1,/ls. 

(b) Prove the invariance of these expressions under any linear transformation of 
co-ordinates. 

(c) Show that Eq. 1 is equivalent to 


d? = 1,2 + 1,2 — Qyle cos 6 


(d) For any vector « with contravariant and covariant components &, and £*,, 
show that 


[Ax] X [\*e]e = 1 


Observe that \; = &,/2 or \*, = &;// cannot be represented as the cosine of a real 
angle but that —1 5 A,A*, S 1. (The dy, are called direction parameters and the 
A*;, the moments of the vector.) 

(e) In a given n-dimensional co-ordinate system let A,, and A*,, be the parameters 
and moments of a set of vectors x, whose components are regarded as the elements of 
tow matrices. If a new co-ordinate system is introduced through a transformation with 
the nonsingular matrix [cy], the parameters and moments are carried over into the 
- values Xe4 and *,,. Show that the following expressions apply: 


[Nex] = [Ase] & [cz] = and = [A* ex] = [A*sc] X [ev]? 


13. The rows in the following matrix represent components of a given vector set in 
an orthogonal Cartesian co-ordinate system: 


4 —5 1 
—2 5-1 
3 0 1 


(a) Compute the magnitudes of these vectors, their direction cosines, and the cosine 
of the angle between each pair. 

(b) Find the inverse matrix and, for the inverse vector set which it represents, 
compute the corresponding quantities to those in part (a). 

(c) Make a 120° isometric plot showing all vectors. 


14. Make computations as in parts (a) and (b) of Prob. 13, considering the matrix 


1 4 1 3 
0 -1 3 -1 
3 1 0 2 
1.-2 5 1 


Repeat the computations, using the transposed matrix. 

15. (a) Consider the vector set of Prob. 13 to define the directions of an oblique 
co-ordinate system. With reference to the original orthogonal Cartesian system, the 
new vector [1, 3, —1] is given. For this vector compute the contravariant and covari- 
ant components, as well as its direction parameters and moments (see Prob. 12). 

(b) Make an isometric plot showing the position of the vector and its above- 
mentioned components. 

(c) Repeat part (a) with reference to the vector set of Prob. 14. 
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16. Let the direction parameters \1, and do for & = 1,2,-++ 2 be regarded as the 
contravariant components of two vectors 71 and v2 (sometimes called “ versors ”). 
If 9 is the angle between these versors, show that 


n 
sn? 9= DO ,, BiiBre — gin@ie)AuArjAonALe 


task, t= 
n 
> ae) Ca gutdasdar ) - p= grid) ($ >a _ ik) 
4j=1 = ik 


in which gj, are the components of the fundamental metric tensor. 


17. The matrix of the fundamental metric tensor in a given space of four dimensions 
is 


el Se 
m WD pe 
a od 


Compute: 

(a) The angles between the oblique axes. 

(b) The corresponding metric coefficients (reciprocals of the lengths of the unit 
vectors a1 °° + a4 of the oblique system). 

(c) The length of a vector whose contravariant components are (5, 3, 2,1), and 
its direction parameters. 

(d) Compute the matrix $*. 

(e) The angles between the oblique axes defined by $*. 

(f) The corresponding metric coefficients. 

(g) The covariant components and moments of the vector given in part (c). 


18. Let § be the fundamental tensor corresponding to a system of oblique axes 
and let £] be a column matrix whose elements are the contravariant components of a 
vector. Because of a co-ordinate transformation with the nonsingular matrix C, the 
contravariant components of the given vector become the elements in the column 

matrix f= C x €]. Denoting by G the matrix of the fundamental tensor with 
respect to the new co-ordinate axes, what are the expressions for § and §* in terms 
of G and G*? 

Taking for G the matrix in Prob. 17, and (5, 3, 2, 1) for the contravariant com- 
ponents of the given vector, compute the corresponding components of the vector 
with respect to the new coordinates if the transformation matrix is 


1 4121 3 
0 -1 3 -1 
C=|3 10 2 
1-25 1 


Compute the angles between the new co-ordinate axes. 

19. Consider an oblique co-ordinate system and a point P such that all the contra- 
variant components of a vector drawn from the origin to P are positive. The portions 
of the oblique axes coinciding with these vector components are regarded as the 
coterminous edges of a parallelepiped. 

(a) Show that the volume of this parallelepiped is given by the formula 


V = VG Exkoks 
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in which G is the determinant of the matrix G and £1, £2, £; are the contravariant 
components of the vector. 

(b) Write down the expressions for the areas of the faces of this parallelepiped 
which lie in the 1,2-plane, the 2,3-plane, and in the 3,1-plane respectively. 

(c) Recognizing that the extension of the above formula to more dimensions reads 


V = VG Eiko ++ En 


compute the volume of the hyper-parallelepiped in four-dimensional space defined by 
the matrix § of Prob. 17 and a vector with the contravariant components (1, 2, 3, 1]. 

Compute the area of the parallelogram for each pair of co-ordinates. 

20. Let x1, %2,°-+%, be the contravariant co-ordinates of a point P in 2-dimen- 
sional space with respect to a set of oblique axes. This system is regarded as immersed 
in an m-dimensional space with m > n. In the m-dimensional space the co-ordinates 
of the same point P with respect to an orthogonal set of axes are yi, y2,°** Ym. The 
orthogonal and the oblique axes have coincident origins. - 

Show that the y;’s are expressible in terms of the x;’s in the form 


n 
Ze, tiie = 98 (@ = 1,2,-+-m) 


but that not more than x of the ,’s are independent. More specifically if the first n of 
the above equations are independent show that yni1, Ynt2,°** Ym May be expressed 
in terms of y1‘- + yn, by means of the relations 

1 n n 


> LD aA jy; = Vi G@=n+1,n+2,-++m) 
A j=1 R=1 


in which A is the determinant corresponding to the first 1 rows of the matrix [@mn] and 
Aj are the appropriate cofactors. aE 
Show further that the squared line element ds? is expressed by 


dt = ¥ (dy)? = EY SM Mae, ae 
ee ee fh aay Oe, 


n 
= x ujdXE dx; 
k,j=1 


in which 
= F IM: Oy: 


is a £=10x% Ox; 


are the components of the fundamental metric tensor in the #-dimensional space. 

Compute the values of gz; in terms of the a,, of the transformation. 

21. Let x] and a] be column matrices and regard their elements x1-+- x, and 
@1*** @, as components of a variable vector x and a fixed vector a with respect to an 
orthogonal co-ordinate system. ; 

(a) Show that the equation 


a]x] =d —_ (ascalar) 


represents the equation of a plane, that the vector @ is normal to this plane, and that 
its distance from the origin is / = d/(magnitude of a). 

(b) In transforming to a new system of oblique co-ordinate axes, show that the 
distance of / and the point defined by a@ will not change if the vector x is subjected to. 
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the transformation 

Q*x] = £] 
whereas the vector a transforms as 

Ga] = a*] 


22. Let £] and a] be column matrices whose elements are the contravariant com- 
ponents of a variable vector and a fixed vector respectively with reference to an 
oblique set of axes in three-dimensional space. Show that the plane defined by 


a];xé]=d (ascalar) 
cuts the co-ordinate axes at points whose distances to the origin are given by 


d 
sp = Veu— 
i 8ii a; 
and that the cosines of the angles that the plane makes with the co-ordinate axes are 


iV 8ii 


cos 6; = “Tel 


in which [a] denotes the length of the vector a. 
23. Let the equations of two planes be given by 
a@]:xXé]=d, and £])t Xt] =de 
Show that they intersect at an angle @ for which 


cos? = ala x § X B] 


and compute this angle for the planes corresponding to vectors a and 8 with com- 
ponents 3, —1, 4 and 2, 8, 6 with 


3°21 
G=)2 41 
1 1 2 


24. Suppose that a space of three dimensions is immersed in a space of four dimen- 
sions. Let y be a vector whose components when referred to a set of orthogonal axes 
in the four-dimensional space are yi, ye, v3, y4. The point determined by this vector 
lies also in the three-dimensional space and is there characterized by the co-ordinate 
X1, Xe, X3. 

Denoting three constant vectors in the four-dimensional space by a, 8, y, show that 
the equation 


y1 92 Ys 4 
a1 a2 03 O41 =d (a scalar) 


represents a plane in the three-dimensional space and find the equation of this plane 
in terms of the x-co-ordinates, recognizing that one may write 


3 
Le cure =y1 for i = 1,2,3,4 
k=1 
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25. With respect to an orthogonal set of axes in three-dimensional space, three 
points P, P2, P3 determined by the linearly independent vectors a, 8, ) lie in a plane. 

(a) Write the expression determining a variable vector « which terminates in this 
plane. . 

(b) Determine a unit vector perpendicular to the plane. _ 

(c) If the points Pi, Pe, Ps are transformed into P1, Pe, Ps; through a nonsingular 
transformation with the matrix @ such that the vectors become 


a] = Qa] 
8] = 2.6] 
7) od Gry] 


show that the transformed vector *] = @,«] terminates in the new plane. 
(d) Let the components of a, 8, y be given by (3, 2, 1), (4, 1, 5), (1, 1, 1), and find 
the transformed plane corresponding to 


1 2 3 
@Q=]1 2 4 
13 1 


26. Let P be a variable point on the surface of a hypersphere of radius p and with 
its center at Po. P and Pp» are determined respectively by the variable vector x and 
fixed vector w. The co-ordinate system is orthogonal and the equation of the hyper- 
sphere is given by 


{an — axle X [x4 — ax] = p? 


in which «1° -- x, and a,:*-a@, are regarded as elements of column matrices. 

If the points P and Pp are transformed into P and Py through a nonsingular trans-_ 
formation with the matrix @, compute the distance between the two new points P 
and Py and show that the new point P does not in general lie upon a new hypersphere. 

Illustrate for » = 2 with a, = a2 = 1 and for the matrix 


1 1 
@ = [3 2 
find the locus of the point P as the point P traverses its circle. 

27. (a) If a-matrix & is obtained through a collineatory transformation of the 
matrix (@, show that the matrices B* and B-! are obtained through the same collin- 
eatory transformation of the matrices @" and @-1. 

(b) If matrices B and & are obtained through the same collineatory transformation 
of matrices @ and QD respectively, show that the matrices (B + &), BE, 6B, 
(BS), and (6 B)— are obtained through the same collineatory transformation of 
(@ + D), AD, DA, @D)“1, and (DA)! respectively. 

©) If & is obtained through a collineatory transformation of @, how is &; related 
to 


#P 
(d) Make a collineatory transformation of the matrix 


3°21 -1 14 1 
@=|]-4 1 1! withthematrixC =| 1 #0 —-2 
-1 -2 5 1-2 1 


(e) Let x and @ respectively be a variable and a fixed vector, and assume that 
al; X x] = 0. If y] = C-'Q Cx], in which @ and C are nonsingular, determine the 
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vector 8 satisfying the relation 8]-y] = 0 and illustrate for n = 3 witha], = [3, 2, 1] 
and matrices @ and @ as given in part (d). 


28. Find the latent roots of the following matrices 


1 -13 24 12 4 191 257 16 4 3 
0 -11i 21 13 9 114 ~152 8 bi 2 
0 -6 12 1 4 16 8 -115 8 


29. Find the latent roots of the matrix 


21 1 
—2 1 3 
3 1 -1 


Form the modal matrix © and check numerically the relation 2-1@& = A. Make an 
isometric plot showing the directions of the principal axes. 


30. Prove that the determinant and the rank of a matrix remain invariant under a 
collineatory transformation. 


31. Show that the roots of the characteristic function are invariant to elementary 
transformations of the characteristic matrix. Through an appropriate succession of 
elementary transformations of the characteristic matrix, show that the latent roots 
of the matrix 


5 0 -1l 0 
05 0 -1 
9 1 5 0 
09 oO 5 


are Ay = Ag = 5 — 73 and Ag = Ag = 5 4-73. 


CHAPTERIV. 


Quadratic Forms 


1. THE QUADRATIC FORM ASSOCIATED WITH A LINEAR TRANSFORMA- 
TION : 


If in the linear transformation 


Qy1% + Ay2%2 + +++ + Ainkn = V1 
Gq1%1 + Ge2%q + +++ + Gentn = Yo [1] 
OniX1 + AngXg + +++ + AnnXn = Va 

the equations are multiplied respectively by x1, x2, - > - Xn, the sum of the 


resulting left-hand members is a rational entire function which is homo- 
geneous and quadratic in the variables x, --- %,. This function is called a 
quadratic form. More specifically it is referred to as the quadratic form 
associated with the linear transformation 1. 

Written out, this function has the appearance: 


Fe= 04104? + Gygxgxy +--+ + QinXnXt 
H+ doy%1%q + dge%e? + +++ + donXnXe [2] 


ee 


+ dniX1%n + AngXet%n +++ + Anndn? 


The terms on the principal diagonal of this square array involve the 
squares of the variables x, ---,; the remaining terms involve all the 
possible cross-products. 

The matrix @ of the transformation 1 is called the matrix of the 
quadratic form, and its determinant A is referred to as the discriminant 
of the quadratic form associated with this transformation. This chapter 
is concerned with real quadratic forms, that is, such forms which have 
matrices with real elements. 

The writing of this function can be abbreviated through the use of a 
-double summation, thus 

F= , x : AgpXiXy [3] 
Here the terms for i = 1 and & from 1 to ” are those in the first row of 
Eq. 2; the terms for 7 = 2 and k from 1 to ” are those in the second row of 
Eq. 2, and so forth. 

Since the terms with a,, and @,; for like values of z and & (k ¥ 7) involve 
the product of the same pair of variables x; and x; (for example, the terms 

132 


Art. 1} QUADRATIC FORM OF A LINEAR TRANSFORMATION | 133 


@12%%2 and d21%2%1), the quadratic form is evidently no less general if the 
matrix @ of the associated linear transformation is assumed to be sym- 
metrical; that is, if 


Qik = Ake or (Cm > a : [4] 


Hence in the discussion of quadratic forms it is possible to assume that 
the associated linear transformation is a symmetrical one, and thus to 
make available to the discussion the special properties of such transforma- 
tions pointed out in the last article of the preceding chapter. 

A quadratic form may also be thought of as obtained through the 
_ Squaring of a linear form; thus 


F = (aX, + agtg + +++ + andy)? [5] 


although the result is less general because it contains only m arbitrary 
coefficients.* Its formal appearance is, however, the same, as may be 
seen if all the terms in the full square represented by Eq. 5 are system- 
atically written down; thus 


P= ayy? + ayargtyxg + +++ + ayanXiXn 
+ axgar%o%, + ag2%o? +--+ + aganXe%n [6] 
HA ano %n% + AnagtnXe +--+ + ee og 


Identification of Eggs. 2 and 6 leads to the relation 
Qi, = Aa, [7] 


The function F as given by Eq. 5 is more convenient for the derivation 
of the following useful relations. Thus it is readily seen that the partial 
derivative of F with respect to one of the variables x; is given by 

oF 
an = 2a; (a,x, + agx%e +--+ + an%n) [8] 
% 


By substitution of Eq. 7, the corresponding result for the function F of 
Eq. 2 readst 


: 
2 


This result allows the quadratic form to be expressed in terms of its par- 


oF 
— = Ay % + Aynke + °° + + Ainkn [9] 
OX; 


' *The rank of the quadratic form (which is the rank of its matrix) is in this case 1, as may 
be seen from the fact that the coefficients given by Eq. 7 are formed in the same fashion as 
are those of the matrix in Eq. 131, Ch. ITI. For the present discussion the rank of the quad- 
ratic form is immaterial. 

{This result may, of course, also be obtained directly from Eq. 2, utilizing the symmetry 

condition 4. 
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tial derivatives thus, 


1(aF aF oF 
Was eee ee eres 10 
en a ot OX2 we + Bs OXn . | ] 
or in the more compact notation as 
12 oF 
=p Deane es 11 
. 2 x Ox; = [ ] 


Correspondingly, the linear transformation 1 associated with a quad- 
‘ ratic form F may, according to Eq. 9, be written in the form 


1 oF 
~~ HM fora = 1,2,--+-n 12 
Fae , 2, +++) [12] 
2. GEOMETRICAL INTERPRETATION OF A QUADRATIC FORM; THE 

QUADRIC SURFACE ASSOCIATED WITH A LINEAR TRANSFORMATION 


According to the derivation of the quadratic form from its associated 
linear transformation 1, it is clear that the function F may be identified 
with the following bilinear form in the variables x, --- 2, and y--+- Yn: 


F = xyy1 + *aye +--+ + 2nn [13] 


If x1.-- +p, and y, - - - ¥_ are regarded as the components of the vectors x 
and y respectively, the quadratic form is represented by the scalar 
product 


F=n-y [14] 


That is to say, the value of the quadratic form may be expressed as 
the scalar product of the vector x with its transform y. Of interest in 
connection with this interpretation is inquiry into the significance of the 
equation 

F=1 [15] 


or in other words into the question: For what values of the variables 
21 -+* 2% does the quadratic form maintain a constant value? In Eq. 15, 
the constant value is arbitrarily set equal to unity. The question may, in 
view of Eq. 14, be stated in another way: For what lengths of the vector x, 
which, it is assumed, may have all possible orientations, does the scalar 
product of the vector with its transform y have the fixed value unity? 
As the vector x assumes all possible orientations, under the stipulation 
that its scalar product with the vector y maintains a unit value, its length 
_ is evidently forced to vary in a very definite manner. The tip of the vector 
x, therefore, describes a surface (in n-dimensional space this surface is 
n — 1 dimensional), and the solution to the inquiry stated above is seen 


Ari. 3] TRANSFORMATION OF VARIABLES 135 


to resolve itself into the determination of the surface defined by Eq. 15, 
in which F is a function of the variables x, - - - Xn. 

Since the function F is quadratic in the variables x, - - - x,, the surface 
in question is evidently one of second order, which is also known as a 
guadric surface. In three dimensions the quadric surfaces are those of the 
familiar ellipsoid, the paraboloid, or the hyperboloid. The quadric sur- 
faces in n-dimensional space may be visualized in a similar manner, 
although their reality is, of course, lost. 

Since the function F contains no linear terms, its value is unchanged if 
the algebraic signs of all the variables are reversed. This fact means that 
the vector x satisfying Eq. 15 has the same length when its direction is 
reversed. Hence the quadric surface is symmetrical about the origin. 
A surface of this type is referred to as a central quadric surface (alterna- 
tively as a central conic). 

The central quadric surface has hyperbolic or ellipsoidal characteristics, 
or both, depending upon the values and the algebraic signs of the co- 
efficients a, in the form F. This question is discussed further in a subse- 
quent article. 


3. TRANSFORMATION OF VARIABLES 


When the variables x; - - - x, in the quadratic form given by Eq. 3 are 
subjected to a linear transformation such as 


x] = Ce’] [16] 


in which x] and x’] are the column matrices 


Ly rol 
Xo x5 

a] =| _ and oo’) =] [17] 
Ln : Xn 


and C is an arbitrary square matrix of order ”, then F becomes a quadratic 
form in the new variables x’, - - - x’,. When the matrix C is nonsingular, 
the new variables are uniquely related to the original variables x - - - n, 
and hence for given variables x, - - - , the quadratic form expressed in 
terms of the new variables has the same value. In other words, the 
quadratic form in the original variables is identical with that expressed 
in terms of the new variables, the change of variable being nothing more 
than a formal change in the notation. . 

At the same time, the transformation expressed by Eq. 16 may have 
the geometrical significance of a change of co-ordinate systems. “In any 
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case, the immediate problem is to determine the matrix of the quadratic 
form for the new variables in terms of its matrix for the original variables 
and that of the transformation 16. 

For this purpose, it is useful to recognize that Eq. 2 for the quadratic 
form may be written in matrix form. First the transformation 1 is written 
in the matrix form 

Ga] = 9] [18] 


_in which @ is the matrix of the coefficients appearing in Eqs. 1, and y] 
is the column matrix 


WM 
Ye 
y=]. [19] 
Yn 
According to the form for F given by Eq. 13, it is seen that 
F= xy] [20] 


in which #, is the transpose of the column matrix x]. Substituting for 
y] from Eq. 18 yields the desired matrix expression for F, namely, 


F= « Gx] [21] 
The expression for F in terms of the new variables x’,---2’, is now 


readily obtained by substituting for x] from Eq. 16, and recognizing that 
the transpose of this equation gives 


x= zx eC; [22] 
Thus it is found that 
F= x'C;x@ xX Cx’] [23] 
which may be written , 
F = x’ Bx’) [24] 
in which : 
B=C,xAXE [25] 


is the desired matrix of F for the new variables x’,. 

Equation 25 gives the transformation formula for the matrix of a 
quadratic form when its variables are subjected to a linear transformation 
with the matrix C. This is called a congruent transformation of @. It is 
recognized that since @ is symmetrical, the matrix 2 is also symmetrical, 
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It may be noted that the transformation expressed by Eq. 25 is very 
similar to the collineatory transformation given by Eq. 167 of Ch. III. 
The two transformations become identical if the matrix C is an orthogonal 
one, that is, if the transformation 16 relates the co-ordinates x, --- x, 
in a given rectangular co-ordinate system to the co-ordinates x’, --- x’, 
in another rectangular co-ordinate system with the same origin. This 
conclusion is recognized from the fact that for such a transformation 
C, = C! (orthogonality condition). 

Moreover, since the determinant of an orthogonal matrix has the 
value +1 (see Art. 6, Ch. I, and Art. 4, Ch. ITT) it is clear that for this 
kind of.co-ordinate transformation the determinant of the matrix & is 
equal to that of @. In other words, the discriminant of a quadratic form is 
invariant to an orthogonal transformation of its variables. 


4. THE PRINCIPAL AXES OF A QUADRIC SURFACE; THE REDUCTION 
OF A QUADRATIC FORM TO A SUM OF SQUARES; DEGENERACY AND 
RANK ; 


The quadric surface defined by Eq. 15, although central, does not 
have its principal axes (like the major and minor axes of an ellipse) 
coincident with the axes of the reference co-ordinate system (the set of 
rectangular axes on which the components 2; - - - x, of the vector x are 
projected). If they were coincident, only the square terms in Eq. 2 
would be present. The presence of the cross-product terms means that 
the principal axes of the quadric surface have arbitrary orientations in 
space relative to the directions of the reference axes. 

It is a common problem in analytic geometry to determine the orienta- 
tions of the axes of the quadric surface defined by Eq. 15 when the 
function F is given in the form of Eq. 2. If these directions can be found 
and chosen as the axes of a new co-ordinate system, a change of variables, 
which amounts to a transformation to these new co-ordinates, should 
have the effect of eliminating the cross-product terms from the expression 
for F. 

Thus the geometrical problem of finding the principal axes of a quadric 
surface is seen to be essentially the same as the algebraic problem of 
reducing an arbitrary quadratic form to a sum of squares. 

According to the vector interpretation given in Art. 2 of this chapter, 
the quadric surface is generated by all possible x-vectors whose scalar 
products with their transforms y have the fixed value unity. For any 
given direction of the vector x, its length is the distance from the origin 
to the surface. If the direction is along a principal axis, this distance is 
evidently a maximum or a minimum as compared with the distances for 
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any neighboring directions.* This fact forms a convenient basis for finding 
the principal axes, since it reduces the task to a maximum-minimum 
problem. 

The square of the length of the vector « is given by the expression 


fac]? = ay? + ay? +--+ + oy? [26] 


Since the maxima and minima of the length of x are coincident with those 
of the square of this length, there is no need to take the square root of the 
expression 26. In order that this length may represent the distance from 
the origin to the quadric surface, however, the variables x - - - %, must 
satisfy the equation 


F=1 (27] 


in which F is given by Eq. 2. 

According to the usual procedure for finding maxima, the expression 
for one of the variables obtained from Eq. 27 is to be substituted into 
Eq. 26, and the resultant function made a maximum or minimum by 
setting its partial derivatives with respect to the remaining variables 
equal to zero. Obviously, this undertaking is exceedingly awkward in 
view of the complicated form of the function F. Hence in problems of this 
sort a slightly altered procedure, known as the method of determining 
conditioned maxima (or minima), is adopted. 

Thus it is said that the function 26 is to be made a maximum subject 
to the condition imposed by the auxiliary relation 27. The procedure is 
to form the function ° 


F(a, +++ Xn) = Hy? + HQ? + +++ + 4,? — HF — 1) [28] 


in which H is an arbitrary constant (called a Lagrangian multiplier). 
In view of Eq. 27, this function is evidently the same as |x|, or rather it 
should be said that the function 28 becomes the same as |x|? when the 
condition 27 is fulfilled. 
The partial derivatives of f(«1, ++ -%n) equated to zero, that is 

of 

aa fork = 1,2,---n [29] 
yield ” equations: Together with the condition 27, these may be solved 
for the multiplier H and the special values of x, +++, which yield the 
desired conditioned maximum. 


*This circumstance is readily visualized in the case of an ellipsoid, for which all these 
distances are rea]. For the moment, the geometrical interpretation may be confined to this 
type of surface, although the results of the present discussion, as appears later on, are not 
restricted to. ellipsoidal quadric surfaces. The discussion in Art. 10 deals further with this 
question. 
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Forming the partial derivatives in Eq. 29 for the function expressed 
by Eq. 28 gives 


2x1 — Hoe =0 

Ox, 

oF 
iain id [30] 
2%n, qo = 

OXn 


If the values for the partial derivatives of F given by Eq. 9 are substituted, 
these equations become 


Oy3%y + Aye%q H+ > + AinXn = NY ; 


@g1X + Ago%q + +++ + Aonkn = Ne [31] 
Oni X1 + GnoXe + +++ + Gnntn = Mn 
in which 
1 
A= H [32] 


On the other hand, if Eqs. 30 are multiplied by %1, x2, - - - x, respectively 
and added, Eq. 10 being noted, there results 


(x4? + x7 +--+ + 4,7) —-HF =0 [33] 
and in view of the condition 27, this yields 
H = (a1? + a2? + +++ + on?) = [2/? [34] 


Equations 31, 32, and 34 together represent the solution to the maxi- 
mum-minimum problem. The results are seen to be extremely interesting, 
since Eqs. 31 are recognized as the conditions for which the direction of 
the vector x is left unchanged by the linear transformation 1, as dis- 
cussed in Art. 10 and 12, Ch. III. The results of the present problem, 
therefore, yield an alternative geometrical interpretation for these 
directions, namely: The principal axes of the quadric surface defined by 
Eqs. 2 and 15 together are those directions in space which are invariant to 
the symmeirical transformation 1. 

According to the discussion in Art. 10, Ch. III, the direction cosines 
of the principal axes are, therefore defined by Eq. 147 of that chapter, 


in which the &,, are the cofactors of the elements of the determinant in. 
Eq. 146 for the latent roots 4,. Each latent root defines a corresponding 
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principal axis, and the modal matrix £, given by Eq. 151, yields the 
directions of the complete set of ” principal axes. 

Since in the present discussions the matrix @ of the transformation 1 
is symmetrical, the modal matrix & is orthogonal, as shown in Art. 12, 
Ch. III. In other words, the principal axes of the quadric surface form a 
mutually orthogonal set. 

According to Eq. 32, there are n values for the Lagrangian multiplier 

1 

He => [35] 
corresponding to the m latent roots of the matrix @. The quantities Hy, 
He,-++Hn, which are the reciprocals of the latent roots, are called the 
proper values (eigenwerte) of the matrix @ or of the corresponding quad- 
ratic form F. Since the latent roots for a symmetrical matrix are real (as 
shown in Art. 12, Ch. IIT), it is seen that the proper values of the quad- 
ratic form F are real also. 

An interesting geometrical significance of these proper values is given 
by Eq. 34, which, for a particular vector x appropriate to the root ds, 
reads 

| = (G1? + dp? + +++ + Sn?) = |B)? [36] 
and hence represents the square of the length of the corresponding 
principal axis extending from the origin to the quadric surface. These 
lengths are commonly referred to as the semiaxes of the quadric surface. 

This result yields a geometrical interpretation to the latent roots of a 
symmetrical matrix. They are the reciprocals of the squares of the semiaxes 
of the associated quadric surface. If the latter is ellipsoidal, the lengths of 
all the semiaxes are real, and the squares of these lengths are positive. 
The quadric surface is, therefore. ellipsoidal when ail the latent roots of 
- the matrix @ are positive. The appearance of negative roots indicates 
that some of the lengths of the semiaxes are imaginary and hence that the 
surface has hyperbolic as well as ellipsoidal characteristics. If all the roots 
are negative, the surface is an imaginary ellipse, but it is customary to 
include this in the classification of completely ellipsoidal surfaces. In 
any case, the above maximum-minimum problem deals only in real 
values, since the square of the length of the vector x rather than its length 
enters into the manipulations.* 

If the principal axes of the quadric surface are chosen as a new co- 
ordinate system, and if the variables x’, - - - x’, refer to this system, the 
co-ordinate transformation from the original variables x, to the new 
vena x’, is given by the matrix equation 


x] = &x"] [37] 
*The discussion in Art. 10 is also relevant to these matters, 
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According to Eqs. 16 and 25, the matrix of the quadratic form F for the 
new variables is 


&x@xe [38] 
But by Eq. 208 of Art. 12, Ch. ITI, this is the diagonal matrix 
mm 0 0 -:--0 
A=2,xXQxf&= 


Ce 


[39] 


Hence the ae form F expressed in terms of the new variables 
x’, reads 


’ Ax’ ] 
= M071 , + Aga’ 9? + pee + Nae a [40] 
42 32 / 2 
ede ee oe let 
jee : Mien: 


Equated to unity, this is the equation of the quadric surface in the 
rectangular co-ordinate system which is coincident with the principal 
axes of that surface. It is referred to as the normal form for the equation 
of the surface. At the same time, Eq. 40 represents the reduction of the 
quadratic form F to a sum of squares. 

This reduced form for F, together with the geometrical interpretation 
of the process of reduction, now yields a basis for interpreting the sig- 
nificance of the occurrence of zero roots or coincident roots to the char- 
acteristic equation of a symmetrical matrix. Thus coincident \-roots 
evidently indicate a certain degeneracy of the associated quadric surface. 
For example, in three dimensions a coincidence of two of the latent roots 
in the case of an ellipsoid results in an ellipsoid of revolution, whereas for 
a coincidence of all three latent roots the ellipsoid of revolution degener- 
ates into a sphere. 

When two of the latent roots are coincident, it thus becomes clear from 
the discussion of simultaneous homogeneous equations in Art. 7, Ch. III, 
that the characteristic matrix (@ — AU), with » equal to the value of 
the coincident root, must have the rank ” — 2. 

This reduction in rank of the characteristic determinant is required 
because a repeated nonzero A-root would otherwise suggest that in- 
trinsically less than ” independent axes were required to describe the 
associated quadric surface. In other words, there would be less than ” 
independent directions for which the vector describing the quadric 
surface goes through extrema. It-is geometrically clear, however, that in 
the case of repeated nonzero )-roots, the quadric surface still occupies 
intrinsically 7 dimensions in space, and therefore requires independent . 
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axes to describe it. Although the symmetry resulting from the equality 
of the lengths of two or more of the semiaxes of the surface means that the 
directions of these semiaxes are not unique, it must nevertheless be pos- 
sible to pick correspondingly two or more independent (in fact, orthog- 
onal) directions along which one may assign axes to describe the surface. 
Hence if there are p repeated nonzero -roots in a symmetric matrix 
describing a quadric surface in m dimensions, it must somehow be possible 
to find exactly p independent, but not necessarily unique, solutions for the 
% corresponding to the repeated root. This fact requires that the char- 
acteristic matrix have rank m — p for the value of d in question. Under 
such circumstances the discussion in Art. 10, Ch. ITI, gives the procedure 
for finding such a set of » directions, and the corresponding nonsingular 
modal matrix £& which describes them. It is necessary to point out, 
however, that this geometrical discussion based upon quadratic forms is 
valid only for symmetric matrices, and, as pointed out in the article 
mentioned above, it is not possible to extrapolate the conclusions thus 
drawn regarding coincident d-roots to the general case of nonsymmetric 
matrices.* 

When the matrix @ of the transformation 1 is singular — more specifi- 
cally, if the rank of the matrix @ is n — p— it is possible to satisfy 
Eqs. 31 in p-independent ways for ) = 0, which means that p of the 
latent roots are zero, and the corresponding p proper values H, (squares 
of the semi-axes of the quadric surface) are infinite. Equation 40 for the 
quadratic form F then has only n — p terms, and the associated quadric 
surface is again degenerate, but in a way somewhat different from its 
degeneracy in the case of repeated nonzero roots. For example, a three- 
dimensional ellipsoid of rank 2 (the matrix @ has the order 3 and the 
rank 2) is an elliptic cylinder because one of its semiaxes is infinite. It is 
still possible to find a nonsingular modal matrix © which reduces the 
original matrix to diagonal form because of the fact that » independent 
vectors % may still be found for \ = 0. The procedure is exactly that 
described in Art. 7, Ch. III, and corresponds to the method used for 
nonzero repeated \-roots. Moreover, the converse of the present state- 
ment is also true; namely, that the occurrence of a zero \-root of order p 
in a symmetric matrix of order ” requires that the rank of the matrix be 
exactly 7 — p. The geometric reasoning substantiating this assertion is, 
again, that it must be possible to find ” dimensions intrinsically occupied 


*In general such a nonsymmetric matrix cannot be reduced to diagonal form by a col- 
lineatory transformation, since no nonsingular modal matrix may be found. An indication of 
what can be done by way of reduction may be found in G. Birkhoff and S. MacLane, A Survey 
of Modern Algebra (New York: The Macmillan Co., 1941), pp. 307-308, or R. Courant and 
D. Hilbert, Methoden der mathematischen Physik (Berlin: Julius Springer, 1931), Vol. I, Ch. I, 
pp. 36-37. 
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by the surface, even if p of them merely indicate co-ordinates upon which 
the function describing the surface is actually not dependent. 

Thus it may be stated that a given quadratic form of rank r, when reduced 
to a sum of squares, contains r terms. That is, the new variables x’; - - - x’, 
are only 7 in number whereas there are # original variables x; - - - Xp. 
This result agrees with the geometrical interpretation because the descrip- 
tion of an elliptic cylinder, in the normal form for example, requires 
only two variables inasmuch as the function describing the cylinder is 
independent of the longitudinal dimension. 


5. A RELATED MAXIMUM-MINIMUM PROBLEM 


A problem complementary to that treated in the preceding article is to 
determine the maxima or minima of the quadratic form F subject to the 
condition 


xe] = ay +a? +---+4,7=1 [41] 
imposed upon its variables. Here the function 
P(@1, +++ Xn) = F ~ Qn? + a2? +++ + a? — 1) [42] 


in which the Lagrangian multiplier is denoted by i, represents the 
quadratic form provided the condition 41 is fulfilled. 
The partial derivatives of f(x, -- - %»,) equated to zero are 


OF SON ee fork = 1,2,--- [43] 
OX% 
In view of Eq. 9, these again yield Eqs. 31. Multiplying Eqs. 43 respec- 
tively by %1, %2,--- %,, adding them, and noting Eq. 10 gives 


F — X(a1? + 42? +--+ + 2,7) = 0 [44] 
Because of the condition 41, this is 
F=hX [45] 


The conclusion is that the maxima or the minima of the quadratic 
form fF, subject to the condition 41, occur for values of the variables 
defining the vectors x whose directions are invariant to the linear trans- 
formation 1. Equation 45 shows that the corresponding maximum or 


. minimum values of the quadratic form are equal to the latent roots of 


its matrix, that is, 
Faz = Ye (s = 1,2,---n) [46] 
min 
In the present instance, the quadratic form F is not restricted to a 
constant and hence cannot be said to represent a quadric surface at all. 
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The tip of the vector y, on the other hand, does trace out a quadric sur- 
face as a result of Eq. 41 and the relation 18 between x] and y]. The 
extrema of |y|?, or the squares of the semiaxes of this surface, are given 
by the reciprocals of the latent roots of its matrix @~? (Art. 4 of this 
chapter). But, by reason of the thoughts underlying the proof of the 
Cayley-Hamilton theorem (Art. 11, Ch. III), these roots are the recip- 
rocals of the squares of the latent roots of @. Hence the extrema of ly|, 
or the semiaxes of the associated quadric surface, are just the latent 
. roots \, of Eq. 46. Thus the quadric surface associated with the trans- 
formation 1 may be alternatively thought of as the locus of the tip of the 
vector , when the tip of the vector x is restricted to lie on the surface of 
an n-dimensional sphere, Eq. 41. The semiaxes of this surface may how- 
ever be calculated in both magnitude and direction from the present 
conditions yielding the extrema of F subject to the condition 41. It is 
of interest to observe in this connection that the surface in question is 
always ellipsoidal, regardless of the nature of the nonsingular matrix @. 


6. AN INTERESTING APPLICATION OF THESE RESULTS 


In view of the results of this alternative maximum-minimum problem, 
a further particular question may be answered regarding the transforma- 
tion 1. In addition to inquiring whether it is possible for the transform of 
x to be zero, or to have the same direction as x, one may ask: Is it possible 
for the transform of x to be a vector at right angles to x? In other words, 
under what conditions is the scalar product x - y zero when neither x nor 
y is zero? 

It is shown by Eq. 14 that this scalar product is equal to the quadratic 
form F. Hence the present question amounts to inquiring whether F 
may vanish for a nonzero vector x, that is, for 


Hy? +x? +--- + 4,7 =k ¥0 [47] 


In the particular application to be considered, it will suffice to find the 
answer to this question for the special case in which the quadratic form F 
is positive definite (cannot go negative). Under such conditions, the places 
where F becomes zero are clearly minima of the quadratic form, and the 
question originally asked is equivalent to asking for the conditions under 
which a stationary point of F shall occur for F = 0.* 

*In the case where F is not a positive definite form, it is not necessary for its zeros to occur 


at stationary points (subject to condition 47). The example F = x1? — x9? illustrates the 
point in question, when coupled with the condition x1? + x9” = k # 0. F is zero only when 


k k 
xy? = x7 = af and the corresponding values of y] are y12 = yo” = a The stationary points 


of F are, however, at x2 = 0, 21° = # and x1 = 0, x9” = &. Corresponding values of F and y] 
are yo = 0, yi? = k and y1 = 0, yo” = k; while F = & and F = —k, respectively. 
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If, then, condition 47 is substituted in place of Eq. 41, the above 
analysis remains essentially unaltered. In particular, Eq. 45 is replaced by 


F=h [48] 


The answer is that F may be zero for k ¥ 0 only when one or more of 
the latent roots are zero. The occurrence of a zero root means that the 
matrix @ is singular. If at the same time Eqs. 1 are to have solutions for 
a nonzero y-vector, the latter must be a member of the transposed vector 
set of @, which is the same as the vector set of @ since @ is symmetrical. 
Although the vector x constituting a solution is orthogonal to y, it is not 
simultaneously orthogonal to all the vectors in the set of @, since y would » 
then necessarily be zero.* 

A useful application of these thoughts is to the problem of determining 
whether a given vector set is linearly dependent or independent. The 


m vectors 01, 22, °** Ym in an n-dimensional space, expressed in terms of 
their components, yield the nonsquare matrix 
V1 Vie ++ Vin 
dio. bib Senedd Waa [49] 
Umi Um2°°° Umn. 


for which it is to be assumed that m < n. 

To determine whether this vector set is linearly dependent or not re- 
quires, according to the discussion in Art. 2, Ch. III, that all m-square 
determinants be formed from the matrix 49 by selecting m columns in all 
possible combinations. The linear dependence of the vector set is estab- 
lished only if all these determinants are found to be zero. 

A considerably less laborious procedure is the following. The vector 
set is linearly dependent if there exists a relation of the form 


X1V1 + X2QV2 + eee + Xmndun = 0 [50] 


in which at least one of the coefficients x; is not zero. The latter condition 
may be replaced by the requirement that 


0? + we? +++ + Xm? = 1 [51] 


which is neither more nor less binding than to require that at least one of 
the x,’s be other than zero. 

The quadratic form obtained by forming the scalar product of the 
vector expression in Eq. 50 with itself may be written 


F = (4101 + x2 + +++ + %m%m)? = Py (0; ° %)Xite 20 [52] 


Fulfillment of Eq. 50 requires that the quadratic form F be zero. Hence 
*These matters are discussed in Art. 7, Ch. ITI. 
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if the given vector set is linearly dependent, it must be possible to find 
F = O subject to the condition 51. In other words, one of the latent roots 
of F must be equal to zero. According to the previous discussion this fact 
requires that the discriminant of the form 52 be zero; hence that 


V1 ° Uy Uz *+ Va 01° Um 
Very U_g7 Vrs? YUyB 

Ge 2 1 2 2 2 m =0 [53] 
Um V1 Um * Ve Um * Um 


Thus the vanishing of only a single m-square determinant (called the 
Gramian determinant) need be investigated in order to establish the 
dependence or independence of the given vector set. 


7. ALTERNATIVE REDUCTIONS 


From the discussion in Art. 4, it should be clear that the reduction of 
a quadratic form to a sum of squares is identical with the transformation 
of its square matrix to the diagonal form. The latter problem is discussed 
in Art. 10, Ch. II, where it is shown that when the matrix @ is sym- 
metrical, reduction of it to a diagonal matrix D is accomplished by the 
congruent transformation 


4x@x2=9 [54] 


in which 2 is a nonsingular matrix, expressible as the product of ele- 
mentary transformation matrices. 

According to Eqs. 16 and 25, it is clear that the corresponding trans- 
formation from the variables 1, -+- x, to the new variables x’; ---x’n 
is given by 

x] = Qx"] [55] 


Hence if the diagonal matrix is written 
ay 0 0 ea 0 
-0 
[56] 


then the quadratic form F, Eq. 2, expressed in terms of the new variables, 
reads 


Fe= dy yx" ;? + dont’ 9? + pear + Gand! n? [57] 


More generally, if the rank of the matrix @ (rank of the quadratic 
form) is 7, the diagonal matrix 9, after suitable arrangement of its rows, 
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has the form 
dy, 0 0 0 
0 dz O 0 
D={/0 O +++ dy +++ 0 [58] 
Gi Oe ase aac. Gate 
0 0 0 
and the reduced quadratic form has 7 terms; thus 
F e= dyyx'y? + dygn's? + +++ + dy’? [59] 


The matrix @, however, possesses no unique diagonal form. According 
to the procedure given in Art. 10, Ch. II, there are any number of trans- 
formation matrices which can take the place of 2 in Eq. 54 and also 
effect a reduction of @ to a diagonal form. For example, another non- 
singular matrix ? may be found such that 


PrxQxP =’ ; [60] 


in which 9’ is also a diagonal matrix. The transformation of variables 
leading to Eq. 60 may be represented by a relation similar to Eq. 55, 


x] = Px’) [61] 


Correspondingly, there are any number of reduced forms for Ee such as 
the one given by Eq. 59. 

In this connection it is important to note, however, that if the rank of 
the matrix @ is 7, all the diagonal matrices to which it may be reduced 
by a transformation of the form given by Eqs. 54 or 60 have the same 
number of nonzero diagonal elements (namely 7), because all the trans- 
formation matrices, 2, ?,---etc., are nonsingular. Consequently, all 
various diagonal matrices are equivalent to @, and equivalent matrices 
have the same rank (see Art. 9, Ch. IT). 

Not only do the diagonal matrices D and D’ have the same number of 
nonzero elements, but they must have the same number of positive 
elements as well. This important property may be demonstrated by first 
assuming the contrary, and thereby arriving at a contradiction. 

Since the variables x’] and x’’] are uniquely related to the original 
variables x] by nonsingular transformations 55 and 61, respectively, the 
quadratic forms resulting from the diagonal matrices D and D’ must be 
identically equal for all corresponding values of x’] and x’’]. If the 
elements of D are now written d,, and those of D’ are written d’;, the 
identity mentioned above becomes 


dy (a1)? ren dite i)? + dysi ara)? Se d,(x’,)? = 
d’, (x’",)? + eee + d’,(x’’,)? + @’ 4a (a0 41)? + a + d’.(x"",.)? [62] 
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Let the notation be chosen so that the first u terms in D and the first v 
terms in 9’ are the positive elements in these matrices. The assumption 
that one of these numbers is greater may be stated by requiring either 
p>vorv>u. It is again merely a matter of notation to assume » < 
»u(Srsn). The following special choice of variables in Eq. 62 is now 
made: 


[63] 


In terms of the original variables x] in Eqs. 55 and 61, the conditions 63 
yield v + (% — ») < mequations in » unknowns. It is then surely possible 
to find a solution for x] in which not all the variables are zero (Art. 7, 
Ch. III). Moreover, the fact that ? and 2 are nonsingular means that 
values of x’’,,,°+-2’, and x’,--+2’, exist such that neither all the «’’; 
nor all the x’; are zero. These special values of x’] and x’’] are now sub- 
stituted into Eq. 62, and the fact that d),---, d, >0 while d’,41,---, 
d’,, < 0 is emphasized by using absolute values of the latter. The identity 
62 then becomes the equation 


dy (x1)? ie d,(x',)? = [a ga] (eng)? Soe a= [d’-|(x"’,)? [64] 


The result 64 is clearly impossible, since the left side is definitely greater 
than zero and the right side cannot possibly be greater than zero. It is 
therefore necessary. to conclude that » = y, since the assumption of an 
inequality leads to the contradiction in Eq. 64. 

All the various reduced forms for F, like the one given by Eq. 59, 
therefore, have two things in common regardless of how this reduction is 
accomplished. The total number of terms is always equal to the rank 7, 
and the numbers of positive and of negative coefficients are always alike. 
This result is known as the law of inertia of quadratic forms. 

If the number of positive coefficients in a reduced form is denoted by 
P, and the number of negative ones by NV, evidently 


r=P+WN [65] 
The difference. 
s=P-—WN [66] 


is called the signature of a quadratic form. Both the rank and the signa- 
ture are thus seen to be invariant to the congruent transformation given 
by Eq. 25, in which C is any real nonsingular matrix. 

Although there are any number of matrices like 2, ?,---, etc., which 
reduce the matrix @ to a diagonal form and F to a sum of squares (ac- 
cording to the congruent transformation 54 or 60), there is essentially 
only one orthogonal matrix which effects such a reduction, namely the 
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modal matrix £& defined in Art. 10, Ch. III.* Only for this orthogonal 
matrix do the coefficients of F in its reduced form equal the latent roots 
of @. In other words, if the elements d1;, doo, - - - dn» in the reduced form 
given by Eq. 57 are equal to the latent roots \1, Ag, - -- A» respectively, 
the matrix 2 in Eq. 54 is the orthogonal modal matrix £&. 

The congruent transformation, which applies to the transformation of 
the variables in a quadratic form (as discussed in Art. 3), should not be 
confused with the collineatory transformation discussed in Art. 10, 
Ch. ITI. Since the matrix @ of a quadratic form is symmetrical, its modal 
matrix £ is orthogonal (as shown in Art. 12, Ch. ITI). The collineatory 
transformation £—'@£&, which carries @ over into the diagonal form of 
its latent roots, is then identical with the congruent transformation 
£,@£&, because the transpose and the inverse of an orthogonal matrix 
are the same... 

However, if the columns of £& are multiplied by arbitrary nonzero 
factors, the collineatory transformation still accomplishes the same 
result (as pointed out in Art. 10, Ch. III), but the matrix obtained by 
this modification of £ is no longer orthogonal,’and a congruent transforma- 
tion of @ by means of it does not reduce @ to a diagonal form and hence 
does not reduce the corresponding quadratic form to a sum of squares. 

-Thus when a collineatory transformation reduces the symmetrical 
matrix @ to the diagonal form of its latent roots, the matrix effecting 
this reduction is not necessarily an orthogonal one (as in the case of a 
congruent transformation), but may be an orthogonal matrix post- 
multiplied by an arbitrary nonsingular diagonal matrix. 


8. DEFINITE QUADRATIC FORMS 


Tf all the coefficients d,, in the reduced form for F are positive or all 
negative, it is evident that the quadratic form is positive or negative 
respectively for all possible nonzero values of the variables. This result 
is true whether the variables are the x’, appearing in the reduced form 
or the x, related to the variables x’, by a nonsingular transformation 
like Eq. 55, because this transformation amounts merely to a change in 
notation. Such quadratic forms are referred to as being either positive or 
negative definite. 

If a quadratic form is positive definite, all the latent roots of its matrix 
are positive. Conversely, if all the latent roots are positive, the quadratic 
form must be positive definite. Incidentally, it is worth noting in this 
connection that the quadric surface of a positive definite form must be 
ellipsoidal. 

*The only freedom in the construction of the matrix © occurs when there are repeated 
latent roots, and the directions of some of the principal axes of the corresponding quadratic 
form are not unique. j 
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When the quadratic form is positive definite, so that all the coefficients 
dix in Eq. 59 are positive, the further congruent (real) transformation 


at 
, uk 
0, = fork =1,2,-+--r 67 
evidently reduces F to the so-called canonic form 
Bee gl 4 oP. +02 [68] 


in which all the coefficients are +1. The requirement that F be positive 
definite is necessary in order that the transformation given by Eq. 67 be 
real. Hence the statement: A positive definite quadratic form (singular or 
nonsingular) may always be reduced to the canonic form, given by Eq. 68, 
by means of a real nonsingular congruent transformation. 


9. A CRITERION FOR POSITIVE DEFINITENESS 


If the quadratic form given by Eq. 2 is positive definite, it is readily 
seen that regardless of its rank, none of the diagonal elements of its 
matrix can be either negative or zero. For, if @,, were such a zero or 
negative element, then setting all the variables except x, and x; equal to 
zero, and letting x; = 1, would give the form the value 


B= ay; + 2ainxy + aunxr? [69] 


which, for az, < 0, can certainly become negative by a suitable choice 
for the value of x;,. Although the condition 


an > 0 . [70] 

is a necessary one for positive definiteness, it is not. sufficient. 
A useful set of necessary and sufficient conditions may be found 
through the following method of representing a positive definite quadratic 


form as a sum of squares. If the quadratic form is assumed to be non- 
singular and positive definite, the canonic form reads 


Fed ale tes +a? [71] 


and the variables x’, --- x’, must be related to the variables «1 +++ tp» 
by a linear transformation such as 


Pit + prote +++ + Pinte = 21 
Poik, + poote +--+ + pontn = x9 72] 


PniX1 + Pnoxs ee os PanXn rE, Cas 


The matrix of this transformation is restricted only by the condition 
that it be nonsingular and real; that is, the coefficients p;, must be real 
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numbers. In matrix form, Eqs. 72 read 


Px} = x] [73] 
and hence 


F= x'x’] = «9, X Pa] [74] 


Identifying this result with Eq. 21 for F . it follows that the matrix @ is 
given by 


GeP.% . [75] 


Tf pi‘, po’, +++ Pa’ represent the transposed vector set of ?, Eq. 75 
shows that the elements of the matrix @ are given by the scalar products 


Giz = pi’ > Pr’ [76] 


It thus becomes clear that the elements of the matrix of a nonsingular 
positive definite quadratic form are necessarily determined from a 
linearly independent set of vectors in a manner similar to that which 
yields the elements of the Gramian determinant given by Eq. 53. The 
sufficiency of this conclusion is appreciated from the fact that if, for a 
given matrix @, a nonsingular matrix ? satisfying Eq. 75 can be found, 
the transformation of variables given by Eq. 73 reduces the equations 
having the matrix @ (Eqs.1) to a set of identities. That is, 
PAP = U (the unit matrix of like order), and all the diagonal 
elements of U are +1. The existence of such a matrix ? is thus recognized 
to be the necessary and sufficient condition for the positive definiteness 
of the quadratic form associated with the nonsingular matrix @. 

Incidentally, it may be noted that the elements of the matrix G of the 
fundamental metric tensor (see Art. 5, Ch. ITI), are formed in a similar 
manner. The latter is, therefore, the matrix of a positive definite quadratic 
form. 

In particular, it may be observed that the elements on the principal 
diagonal of @ are given by 


One = pro Pro = | px? [77] 


These are equal to the squares of the absolute lengths of the vectors. 
According to the remarks made in the opening paragraph of this article, 
none of the vectors ;,! is, therefore, allowed to be zero. This, as well as 
the condition that the vector set ,’ be linearly independent, is assured 
by the requirement that ? be nonsingular. 

It should now be observed that the assumption of a more special form 
for the matrix f does not subject the above argument to any re: 
Thus the vector ,' in the transposed vector set of P may b 
be coincident with axis 1 of the rectangular reference syste 


eG med to%D) 
€ : Se 
S he: PE BR 7 
a y Get Tinas a 


%, 
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vector #2’ is chosen to lie in the plane determined by the axes 1 and 2; the 
third vector #3’ is then oriented so as to be confined within the three- 
dimensional subspace determined by the axes 1, 2, and 3, and so on. 
This procedure in no way restricts the generality of the vector set 
pit +++ Pa’, but merely amounts to a particular orientation of the reference 
axes relative to the given vector set p,’. 

The result of this choice of: orientation for the reference axes is that 
p;' has no components other than that on the axis 1; p2° has no com- 
ponents other than those on the axes 1 and 2, and so forth. The matrix ? 
then assumes the special triangular form 


Pir pio pis *-+ Pin 
0 peo pros +++ pon 
P=|0 O pag :-: pan [78] 


0 0 rae 0) Pan 


The real nonsingular matrix f having this form, the problem now is to 
discover the conditions which are imposed upon the given matrix @ by 
the requirement that it shall have the representation expressed by Eq. 75 
(according to which its elements are given by Eq. 76). The most direct 
’ way of discovering these conditions is to proceed with a determination of 
the matrix for a given matrix @. A possible procedure for this determina- 
tion is suggested by the method of reducing an arbitrary symmetrical 
matrix to a diagonal form by means of the congruent transformation 54 
(as discussed in Art. 10, Ch. II, and there expressed by Eq. 189). Ac- 
_ cording to this method, the matrix 2 effecting the reduction may have 
the same triangular form as # in Eq. 78 if (as is true in the present 
problem) none of the diagonal elements in the given matrix @ are zero. 
The matrix 2, moreover, has diagonal elements which are all +1. 

With reference to Eqs. 54 and 75 one may write 


a = 2°99! = PP [79] 


Since the inverse of the matrix 2 again has the same triangular form with 
diagonal elements which are all +1 (as may readily be seen from any 
method of matrix inversion, for example, the one discussed in Art. 6, 
Ch. IT), the formation of the matrix ? is made evident by Eq. 79. Thus, 
denoting by 9D’? the diagonal matrix whose diagonal elements are the 
square roots of the respective elements of D, one has 


P = Mi2g-t [80] 
- The significant part about this result is that the diagonal elements of ? 


are those of D1/?. The diagonal elements of D, therefore, are the squares 
of the diagonal elements p11, poo, -- - Pan Of P. The positiveness of these 
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squared elements is thus seen to be the necessary and sufficient condition 
for the positiveness of the diagonal elements in D, and hence for the 
positive definiteness of the quadratic form having the matrix @. It 
remains to express this requirement in the form of conditions upon the 
elements of the matrix @. 

The determinant P of the triangular matrix in Eq. 78 is equal to the 
product (p1ipe2: ++ Pan) of its diagonal elements, none of which, because 
is nonsingular, is allowed to be zero. According to Eq. 75, the de- 
terminant A of the matrix @ is 


A = P? = (pipo2 +++ Pan)” [81] 


Now if the variable x, is set equal to zero, x’, (according to the trans- 
formation 73 and the special form of # given by Eq. 78) becomes zero 
also. The quadratic form F then appears as a function of the variables 
%1°**%__, only, or in the reduced form as a function of the variables 
x’, +++ 2,1 only. The elements in the last row and column of the matrix 
@ or of the matrix ? then have no further influence upon the values of 
F in Eq. 2, so that the latter may be regarded as a quadratic form in 
n — 1 variables with a matrix @,_, obtained through striking out the 
last row and column in @. Correspondingly, the last row and column in 
may be struck out, and the remainder denoted by ?,_,. If Pn_1 is the 
determinant of ?,,_; and A»_; that of @,_1, the same reasoning as before 
shows that 

Ana = Poi? = (Pirpe2- ++ pn—1)? [82] 


In a like manner, through also setting %n_1, X%n_2, etc., equal to zero, 
it may be seen that the determinants A,_2, An_z, etc., which are ob- 
tained by striking out the last two, three, etc., rows and columns in A, 
are given by 

Ano = Pao = (pitpe2+** Pro)” 
Anz = Py_s? = (piibe2 +++ Pas)? [83] 


di? = At = uy 


Ag 
2_ 2 
pee A; 
2 _ 4s [84] 
p33" = A 
Pan? oa a 
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The necessary and sufficient conditions that a nonsingular quadratic 
form F with the discriminant A be positive definite are, therefore, stated 
by the inequalities 


A > 0, An—1 > 0,An2 > 0,---Ad = a1 >0 [85] 
A nonsingular quadratic form is positive definite if the discriminant and all 


its principal minors have positive nonzero values. 
The relations 84 together with Eq. 76 afford a method for determining 


. the triangular matrix of Eq. 78 whereby the quadratic form may be 


reduced to the canonic form given by Eq. 71. Thus the diagonal elements 
in ? are given by Eqs. 84 directly. Next, Eq. 76 shows that 


Mn = Pibie fork = 2,3,---n [86] 


from which the remaining elements in the first row of f are determined. 
Using Eq. 76 again, one finds 


Gon = PioPir + Poobor fork = 3,4,-++” [87] 


Here #; is the only unknown, and so the remaining elements of the second 
row of # can be calculated. From 


a3x = Pishizr + Posber + pasbse fork = 4,5,-+-7 [88] 


in which #3; is the only unknown, the remaining elements in the third 
row of ¥ are found, and so forth. 

If the quadratic form F is singular — more specifically, if its matrix @ 
has the rank 2 — »— then the criteria for positive definiteness are the 
same as those given by the relations 85 except that the equals sign is 
included with the first p inequalities. 

Thus if @ is the matrix of a positive definite quadratic form of rank r, 
it may be reduced to its canonic form by means of a real nonsingular 
triangular matrix 2 in the congruent transformation 


1000 0 
0100 0 
9@2=6=|0 0---1 0---0 [89] 
0 0 es 0 
0 0 0 


Here the canonic matrix & has 7 units (+1) on its principal diagonal. 
The matrix 2 has the same triangular form as # in Eq. 78. 
The inverse of 2 again has the same triangular form, and hence may be 
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identified with ? of Eq. 78, so that the singular matrix @ is represented by 
QA = 9,69 [90] 


in place of Eq. 75, which holds only when @ is nonsingular. 

Bearing in mind the form of the canonic matrix & as shown in Eq. 89, 
and applying the same reasoning as in the previous argument in which 
@ is assumed to have the rank , one may establish the above statement 
regarding the modification of the criteria 85 for any rank n — ». 


10. THE ITERATED QUADRATIC FORM 


The quadratic form which results when the matrix @ of F is replaced 
by the &th power of @ is referred to as the iterated form of kth order, or 
as the kth iterated form of F. If £ is the orthogonal modal matrix of @, 
as pointed out in Art. 11, Ch. ITI, 


£L@L = &,ALLGL = [91] 


in which A is the diagonal matrix of the latent roots of @. 
Since 


[92] 


Cc  ) 


the latent roots of the second iterated form of F are equal to the squares 
of the latent roots of F. More generally, the latent roots of the kth 
iterated form of F are recognized as being equal to the kth powers of the 
latent roots of F. 

It is also clear that the modal matrix & which transforms F to its 
normal form (sum of squares) also transforms any of the iterated forms 
of F to their normal forms, and that the latter differ from the normal 
form of F only in that the coefficients of the square terms appear raised to 
the kth power. 

It appears that all the iterated forms of even order are positive definite, 
and that their associated quadric surfaces are entirely ellipsoidal in 
character. Moreover, the principal axes of these ellipsoidal quadrics 
coincide in direction with those of the surface associated with F, since 
the modal matrix £& defines these directions. 

Hence in searching for the principal axes of a quadric surface F = 1, 
it is immaterial whether the given form F or any of its iterated forms is 
considered. With regard to the method discussed in Art. 4, it is thus 
recognized that the argument is not subjected to any loss in generality by 
the assumption, for the purpose of clarifying the geometrical visualization, 
that the given quadric surface is entirely ellipsoidal in character. 
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1]. Tx SIMULTANEOUS REDUCTION OF A PAIR OF QUADRATIC FORMS 
TO SUMS OF SQUARES 


Two quadratic forms are considered to be given by the matrix equations 
F, = x Gx] [93] 


and 
F, = x Bx] [94] 


In the present discussion, it is assumed that at least one of these forms is 
positive definite and nonsingular.* . 

The fundamental principle upon which the simultaneous reduction is 
based is the following. If one of the forms, for example Fo, is nonsingular 
and positive definite, the associated quadric surface may be visualized 
as that of an ellipsoid. The variables x, +--+ x, of the two functions Fy 
and F, are first subjected to the orthogonal transformation which intro- 
duces the principal axes of this ellipsoid as a new co-ordinate system. 
This transformation reduces Fy to a sum of squares but does not in 
general eliminate the cross-product terms from F also. 

A further real transformation of the form given by Eq. 67 next carries 
F. over into its canonic form, and the associated ellipsoid becomes a 
sphere. If F, were not nonsingular, the resulting quadric surface would 
not be spherical; and if F, were not positive definite, the transformation 
which reduces it to its canonic form would not be real. The resulting form 
F, at this stage would then not be real either, and the following step 
could not be carried out by means of a real transformation. 

The equation of a sphere with its center at the origin is evidently in 
the normal form (sum of squares with coefficients unity) regardless of 


the angular orientation of the rectangular co-ordinate axes. Any set of 


orthogonal axes can be principal axes for the sphere. Hence the variables 
in both quadratic forms can now be subjected to any orthogonal trans- 
formation without further affecting the form of F2. At this step an 
orthogonal transformation can be found which reduces F, to a sum of 
squares (transforms to the co-ordinates which are the principal axes of 
the quadric surface defined by the form which F, has at this stage), and 
Fe remains in its canonic form. 

A transformation which combines these three steps evidently accom- 
plishes the desired simultaneous reduction of both F, and F»,. This 


*In physical problems, quadratic forms usually represent energy functions, and hence this 
assumption does not constitute a serious restriction from the practical standpoint. The alge- 
braically more complicated problem which results when these restrictions are not imposed, 
is not treated here [for an introduction to this more general problem the reader is referred to 
M. Bocher, Introduction to Higher Algebra (New York: The Macmillan Co., 1927)]. 
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reduction is now formulated explicitly by means of the following matrix 
equations. 

If F, is positive definite and nonsingular, the discussion in Art. 9 shows 
that its matrix B may be represented by the relation 


B= PP [95] 


in which is a nonsingular real matrix. This matrix may be considered 
to be any one which satisfies the congruent transformation 


P I BP a OU [96] 


SU being the unit matrix having the same order as 8. 
One method of finding * is to determine first the orthogonal modal 
matrix £®) of B which, by means of the congruent transformation 


LOB = A, [97] 


reduces & to the diagonal form of its latent roots. This diagonal matrix 
is denoted by A». Now if A,//? represents a diagonal matrix with diagonal 
elements equal to the square roots of the latent roots of B (none of which 
is zero), | , 


Ay V2 OBL ay? = AU [98] 
A comparison of Eqs. 96 and 98 yields . 
P = alle o [99] 
However, since . 


in which © is an arbitrary orthogonal matrix of the same order as U and 
G, it is readily seen that P may more generally be represented as 


PY = O,A;'/22, [101] 


The matrix ? may be found by this method, or perhaps more simply 
by means of the process discussed in Art. 10, Ch. II. This part of the 
simultaneous reduction of F, and F, is not unique and may be accom- 
plished by whatever procedure appears to be the most expedient under 
the given saincelenean 

If the variables x; - - - x, in the quadratic forms given by Eqs. 93 and 
94 are a to the transformation expressed by the matrix equation 


4) = P-'z’] [102] . 


the matrices of these quadratic forms are subjected to the congruent 
transformations 


P-1G94 = ao [103] 
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and 


P ABP = y [104] 


F, is thus reduced to its canonic form. F, at this stage has the matrix 
@©), which evidently is still real and symmetrical. Hence it possesses an 
orthogonal modal matrix £¢/») which carries it over into the diagonal 
form of its latent roots; thus 


£ (aD QO QLalb) Sh [105] 


These roots are called the latent roots of @ with respect to B. Since @ is 
symmetrical, these latent roots are real. They are positive if @ (as well 
as B) is the matrix of a positive definite quadratic form (see Art. 7 of 
this chapter).* 

The further transformation of variables 


/] = Ly!) [106] 
with the orthogonal matrix &/°), leaves F, in the canonic form 
Fo = (0/1)? + (ala)? Hee + 0 n)? [107] 


and reduces F to the sum of squares 
Fy = dy!) (a0!) rg!) (20/9)? ews + An!” (ae)? [108] 
in which ),(¢/) ..-d,{¢/ are the latent roots of @ with respect to 8. 
- These are the roots of the characteristic equation 
Ia — ru] = |P-'@P — ru] =0 [109] 
Since the latent roots of a matrix are invariant to a collineatory trans- 
formation (see Art. 10, Ch. III), and 
| P,QPP A = APP = AZ [110] 
or 


PAQOP = PP 1G = BIG [111] 


it follows that the characteristic equation 109 may alternatively be 
written in either of the forms , 


|@B — ru] =0 [112] 
or | 
|[B7@ — r»~%U| = 0 [113] 
Since & is assumed to be nonsingular, the equation 
|B(BA — ~U)| =0 [114] 


*This fact is an important one in the study of oscillating systems in dynamics and electrical 
network theory. 
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has the same roots as Eq. 113. Hence the latent roots of @ with respect 
to & may also be defined as the roots of 


|@ — r18| = 0 [115] 


In other words, the characteristic equation for these latent roots may be 
written in any one of the four forms given by Eqs. 109, 112, 113, or 115. 

Since there are no restrictions on the symmetrical matrix @ (except 
that it be real, of course), some of the roots of the characteristic equation 
115 may be negative, or coincident, or zero. Observe that @ and Q 
contained in Eq. 103 are equivalent matrices (because is nonsingular) 
and hence they have the same rank. Therefore, if the rank of the matrix 
@ of fF, is r, the characteristic equation 109 or 115 has r nonzero roots, 
and the reduced form for Ff, given by Eq. 108 has r terms. The canonic 
form for F,, however, must have ” terms because the matrix of F¢ is 
assumed to be nonsingular. 


12. AN ALTERNATIVE GEOMETRICAL INTERPRETATION OF THE SAME 
PROBLEM 


So far in the discussions of this chapter, the various geometrical inter- 
pretations have assumed a rectangular co-ordinate system. Thus the 
equation F = 1 for the quadric surface associated with a given quadratic 
form is visualized geometrically by supposing the variables x, --- x, to 
be the co-ordinates of a point with respect to a rectangular system of 
axes. The normal form for this equation, for which F appears as a sum 
of squares with arbitrary real coefficients, is then familiarly recognized 
as yielding the quadric surface (ellipsoid, for example) with its orthogonal 
set of principal axes coincident with the co-ordinate axes. 

Although the tacit assumption of a rectangular co-ordinate system is 
quite in order, it is just as feasible to suppose that the reference co- 
ordinate system, which is used for the geometrical interpretations, is 
given by an oblique set of axes, but these interpretations must then be 
revised. 

First, in this regard, it is significant to observe that an ellipse, for 
example, remains an ellipse when the angle between the co-ordinate axes 
is allowed to depart from 90 degrees. The equation of the ellipse is 
supposed to remain fixed, and the co-ordinates x; and x2 are then assumed 
to be the parallel projections of a point P upon the oblique axes; that is, 
they are the contravariant components of the vector OP (O denotes the 
origin) which is the vector sum of its components according to the familiar 
parallelogram law of addition. If the given equation is plotted by laying 
off corresponding values of x; and x2 along such a set of axes, the geo- 
metrical form of the resulting figure is still elliptic provided it is elliptic 
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when identically the same equation is plotted by laying off x, and x2 
along a set of rectangular axes. 

Next, it should be noted that even if the equation of the central ellipse 
is in its normal form, the principal axes of the ellipse need not coincide 
with any of the oblique co-ordinate axes. This fact the reader may 
readily establish for himself by plotting a simple numerical example on 
oblique axes. In this connection it should be observed that the equation 
of a circle in rectangular co-ordinates represents an ellipse when plotted 
in oblique co-ordinates. 

According to the previous discussions of this chapter, the equation of 
a given central ellipse is transformed to its normal form by rotation of 
the rectangular co-ordinate system until it coincides with the principal 
axes of that ellipse. In view of the remarks of the present article, it 
appears that this reduction may alternatively be thought of as accom- 
plished through transforming to an oblique set of co-ordinate axes whose 
angular orientations relative to the given ellipse are such as to yield this 
same ellipse by means of an equation which contains square terms only. 

It is significant to note that there are an infinite number of pairs of 
oblique axes for which the equation of an ellipse goes into the normal 
form. The distinguishing characteristic of a normal form is the property 
that a reversal of the sign of any particular co-ordinate of.a point on the 
curve always yields another point on the curve. If any line through the 
center of the figure is chosen as one of the oblique axes, the other axis 
required to yield an equation for the ellipse in normal form is uniquely 
determined. The symmetry requirement mentioned above may be met 
if the second axis is chosen to be the one bisecting all chords of the figure 
parallel to the first axis. Such a pair of axes is known as a pair of conjugate 
diameters, and it is a property of these lines that each bisects all chords 
parallel to the other. It is this property which meets the symmetry con- 
dition characterizing a normal form in an oblique set of co-ordinates. A 
simple way to find the second axis, when any first axis is arbitrarily 
assigned, follows from the realization that the tangent to the ellipse at 
either end of the first axis is a limiting form for the set of parallel chords 
bisected by that axis. Hence the second axis is assigned to be that line 
through the center of the ellipse which is parallel to a tangent to the 
curve at the extremity of the first diameter chosen. 

The clue to the simultaneous reduction of two quadratic forms to 
sums of squares also lies in these ideas. Thus, if two ellipses are given (the 
two forms F and F2 may for the moment be thought of as defining a pair 
of central two-dimensional ellipses) with arbitrary semiaxes and with an 
arbitrary relative orientation, it is possible to find a single pair of oblique 
axes such that both of the given ellipses may be described by means of 
equations which contain square terms only. 
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A sketch of any two such central ellipses will convince the reader that 
it is always possible to choose a first axis in such a way that tangents to 
both ellipses, at the points where this line intersects them, are parallel. 
The second axis is then taken parallel to these tangents, and in this set 
of oblique co-ordinates both ellipses have the symmetry property which 
indicates a normal form for the equations representing them. 

The feasibility of obtaining such a set of oblique axes in the general 
case is thus indicated, though the most expedient method of handling the 
problem analytically does not directly parallel the above geometrical 
discussion. 

With reference to the Eqs. 93 and 94 for the forms F, and Fy, these 
ideas are formulated more specifically by the statement that there exists 
an oblique co-ordinate system defined by the unit vector set C1, C2, °° * on 
of the matrix C, such that a transformation to the contravariant variables 
of this system (see Eq. 43, Art. 5, Ch. III) by means of the equation 


x] = C,é] [116] 
and its transpose 
| 2, = £¢ [117] 


simultaneously carries the expressions for these two quadratic forms 
over into ; 


Fy = £CQCé] = £ Dié] [118] 


I 


and 
Fy 


I 


ECBCE] = £ Dee] [119] 


in which 9, and Dz are diagonal matrices. 

The orientations of these oblique axes, defined by the vector set of the 
matrix C, are found in the solution to the following closely related problem. 
The linear transformations associated with the quadratic forms F, and 
Fs, may be written 


Gx] = y] [120] 
and 


Bx] = z] [121] 


The first of these equations transforms a vector x into a vector y, and 
the ‘second transforms the same vector x into a vector z. The question 
may be raised whether there are any particular directions for the vector x 
(relative to a fundamental rectangular co-ordinate system) for which its 
two transforms y and z are coincident in direction (but not necessarily 
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coincident with x). This condition is expressed by the matrix equation 
y] = 3] [122] 
in which ) is a constant multiplier. 
Substituting Eqs. 120 and 121 into Eq. 122 yields 
(@ — »9B)x] = 0 [123] 
The question, therefore, amounts to inquiring whether nontrivial solu- 
tions may be found to the set of homogeneous equations 
(a4, — Nbi1)X1 + (G12 — Nbi2)%2 +++ + Gin — Nin)en = 0 
(a, — Abo), + (daz — Nbog)%2 +--+ + (don — dbon)%n = 0 


(@n1 — Nbni)*1 + (Anz — NOn2)®2 +++ + (nn — Nonn)%n = 0 
The condition for the existence of such solutions is that the determinant 
of this system of equations shall vanish, that is, 
|@ — 1B] = 0 [125] 
This is identical with the characteristic equation 115 which determines 


the latent roots of @ with respect to &. If the determinant of Eqs. 124 
for a particular root \,°°/” is written 


|@ — r»B\, 44 [126] 


[124] 


& 
and the cofactors of the ith row of this determinant are denoted by kj, 
the direction cosines of a vector x constituting a solution appropriate to 
the root \ = d,‘7/”) are given by 


s 


ky 
7 = fork =1,2,---n [127] 
Vda)? + (big)? + oe + (Rin)? 

provided al] the )- roots of Eq. 125 are distinct. 
For such a vector x with the components ¥1, %2, ° ++ Xn relative to the 
fundamental rectangular co-ordinate system, Eq. 123 may be written 
Qx] = r-/ Bx] [128] 
The 2 equations of this form, for the ~ particular vectors bk 


appropriate to the roots of Eq. 125, may be combined into one matrix 
equation by definition of the modal matrix 


fin fag ++ Pan 
f | eer | 

a ee net [129] 
Pat Pre Lan 


_ the elements of which are the direction cosines defined by Eq. 127. Then 
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the complete set of equations like the one given by Eq. 128 is contained in 
QL = BLAg [130] 


in which Ag» is the diagonal matrix of the latent roots of @ with respect 
to &, that is, the same diagonal matrix as that appearing in Eq. 105. 
_ If Eq. 130 is premultiplied on both sides by the transpose of &, it reads 


£,AL — (2:BEL) Bajo [131] 


The resultant matrices £,@& and £,BE are necessarily symmetrical 
because @ and & are symmetrical. Equation 131 implies that after the 
columns of the symmetrical matrix £,8£ are multiplied by a set of factors 
Al... rn“@!”) respectively, the resulting matrix is still symmetrical. 
This condition can be possible only if £,@£& and £,BE are both diagonal 
matrices. Hence if these are written 


&GL =, [132] 
and 


then a comparison with Eqs. 118 and 119 shows that the matrix C has 
been found, namely, | 


C=, [134] 


The direction cosines of the set of oblique co-ordinate axes, in terms of 
which the equations of fhe two quadric surfaces associated with the 
forms F, and F2 appear as sums of squares, are given by Eq. 127. More 
specifically, the direction cosines of the sth oblique axis are the elements 
in the sth column of the matrix £ defined by Eqs. 127 and 129. The set 
of unit vectors characterizing the oblique axes with reference to the 
fundamental rectangular co-ordinate system is the transposed vector 
set of £&. 

Equations 131, 132, and 133 show that 


Aap = DoD, [135] 


or, if the diagonal elements in D, and Dz are denoted by dy. and dy.” 
respectively, that. 


x, (2/9) — dg [136] 
. de” . 


The quadratic forms F; and F2 given by Eqs. 93 and 94 are, therefore, 
simultaneously reduced to 


F, = dy &,? + dog Eo? fee eh dan En? [137] 
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and 
Fo = dy £1? + dog ko? + +++ + dna En? [138] 
when the variables x, ---%, are subjected to the linear transformation 


The matrix & is, of course, not orthogonal, as is clear from the fact that 
it defines a set of oblique axes or as is alternatively made evident from 
Eq. 130, which can be written 


S(BIAL = Aap [140] 


It thus appears that & is the modal matrix which reduces the dissym- 
metrical matrix ®'@ to the diagonal form of its latent roots. 

As shown in the previous article, these latent roots are real, but if a 
has the rank r, then » — r of them are zero. Since & is nonsingular, none 
of the diagonal eleménts d,,° is zero, but Eq. 136 shows that as many 
elements of the diagonal matrix D,, are zero as there are zero -roots in 
the characteristic equation 125. Thus the reduced form 137 has r terms, 
and that given by-Eq. 138 has ” terms. 

Since F is positive definite, all the coefficients in Eq. 138 are positive. 
A further real transformation of the variables then carries Fz and F over 
* into the forms given by Eqs. 107 and 108 respectively. 


13. A FEW REMARKS REGARDING THE SIMULTANEOUS REDUCTION OF 
MORE THAN TWO QUADRATIC FORMS TO SUMS OF SQUARES 


In the solution of many practical problems, it would be very desirable 
to be able to reduce more than two quadratic forms to sums of squares 
simultaneously. To do so is in general not possible, however, as the reader 
can readily appreciate by considering an attempt to extend the reasoning 
in the opening paragraphs of Art. 11 or 12 to more than two forms. 

If p quadratic forms F,, Fo, --- Fp are given, it is, however, possible 
to reduce them simultaneously in the special case for which there exist 
p — 2 independent relations of linear dependence of the form 


yiFi + veFo + +++ + YpFp = 0 [141] 


in which at least one of the coefficients has a nonzero value. The condi- 
tion 141 must, of course, hold for all values of the common variables 
Xy °° Xn. 

Cases of this sort which do occur in practice are oe problems of reduc- 
ing the three forms F,, F2, and Fs, when 


F, = Fs [142] 
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or of reducing the four forms, Fy, Fo, F3, F4, when 

F, = Fz [143] 
and 

F; = 6F 4 


Evidently, a transformation which carries Ff, and Fs over into their 
reduced forms simultaneously yields a similar reduction for Fz and F4. 


14. THE ABRIDGMENT OF A QUADRATIC FORM THAT RESULTS FROM 
IMPOSING LINEAR CONSTRAINTS UPON ITS VARIABLES 


Consider for the moment a quadratic form F in only three dimensions, 
and visualize its associated quadric surface defined by F = 1 as a central 
ellipsoid. If one demands that one of the three variables x1, x2, x2 be zero 
(for example, if one arbitrarily sets x3 = 0), it is clear geometrically that 
the ellipsoid thereby is reduced to the two-dimensional ellipse given by 
the intersection of the ellipsoid with the co-ordinate plane normal to 
axis 3, ie., the 1-2 plane. This ellipse is defined by an equation F = 
1 in which F is obtained from F enrOuee simply dropping those terms 
involving 43. 

The equation x3 = 0 may be eine as a linear constraint which 

restricts the original variables to values corresponding to points on the 
intersection of the ellipse with the 1—2 plane, the latter being referred to 
as the constraint plane. F, which is a quadratic form in only two variables, 
is spoken of as an abridged form of F. 
. The abridged form is not always so easily found as the very simple 
situation just described. Thus, with reference to the same three-dimen- 
sional form F, suppose that the constraint plane is chosen as an arbitrary 
one, still passing through the origin of co-ordinates, however. In this case 
the equation of constraint is changed from the ra form x3 = 0 to 


P3it1 + psoxe + pasts = [144] 


If p31, p32, P33 are regarded as components of a vector 3, and 21, %2, %3 
as those of a vector x, Eq. 144 demands that the vector « be orthogonal 
for the vector p3, which, in the simpler case considered above, is coincident 
in direction with co-ordinate axis number 3. The constraint plane is 
normal to the vector p3, and may be fixed at will through an appropriate 
choice of this vector. 

The abridged quadratic form F is such a function of two variables that 
F = 1 becomes the equation of the elliptic intersection of the original 
ellipsoid with the chosen constraint plane. Since the variables involved in 
F must refer to a two-dimensional orthogonal co-ordinate system lying 
within the constraint plane, it is clear that these variables are not simply 
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two of the original ones. To find F one must first determine a new set of 
orthogonal co-ordinate axes 1’, 2’, 3’, such that one of these, say axis 3’, 
‘is coincident with the vector ~3 (normal to the constraint plane). The 
new axes 1’ and 2’ will then lie in the constraint plane; and in terms of the 
new variables #1, Z2, Z3, referring to the new axes, it becomes clear that 
the constraint 144 is expressed by the simple relation 7; = 0. The problem 
is thus reduced to the simple form considered above. The abridged form 
F is found by first subjecting the original variables in F to the orthogonal 
transformation appropriate to changing from axes 1, 2, 3, to axes 1’, 2’, 3’, 
and then dropping all terms involving 3. 

The crux of the procedure lies in finding the proper axes 1’, 2’, 3’, and 
from these the appropriate orthogonal transformation. Since axis 3’ is 
coincident with the vector 3 defined by the constraint Eq. 144, the 
problem is essentially that of associating with this vector p3, two other 
vectors p; and p> such that the three together form a mutually orthogonal 
set. To carry out this procedure, one would begin by finding first a vector 
p2 normal to ps, and then a vector p; normal to the other two. Since 
there exists an infinite number of vectors p2 normal to fs, it is clear that. 
the procedure as a whole is not unique, and there exists an infinite number 
of functions F of which any one may appropriately be called the abridged 
form of F for the stated constraint. 

The details of the procedure just described are best made clear through 
a numerical example. Suppose one has given 


F= x4? + 2257 + 3x3” [145] 


and the linear constraint 
x1 + X92 + x3 = 0 [146] 


Choosing F in the form of a sum of squares does not detract from the 
generality of the procedure to be discussed here. 

The constraint vector ~3 has the components [1, 1, 1]. A procedure for 
finding vectors p; and 2 such that the three vectors form a mutually 
orthogonal set may be patterned after the methods discussed in Art. 7, 
Ch. III. In this way the following matrix is readily found: 


5 es 
@=| 1 -1 0 [147] 
i 141 


the components of the vectors ;, p2, p3 being defined, respectively, by 
the elements of the first, second, and third rows. As already mentioned, 
the determination of this matrix # is not unique. It may readily be 
checked by inspection that the three rows of # define a mutually orthogo- 
nal set of vectors. 
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Through dividing the components of each vector by the square root 
of the sum of the squares of its components (called normalization) one 
obtains a corresponding set of unit vectors, whose matrix is orthogonal 
(defines mutually orthogonal unit vectors by columns as well as by rows). 
This orthogonal matrix, which reads 


—0.408 —0.408 0.817 
O= 0.707 —0.707 0.0 


0.577 0.577. 0.577 


yields the desired transformation from the co-ordinates x1, x2, x3 to 
Z1, Zo, Zz, thus 


[148] 


© x «] = 2] [149] 

Hence, according to discussion earlier in this chapter, the quadratic 
form in terms of the new variables has the matrix 

@=0x@x0, £150] 


where @ is the matrix of the given form. 
In the numerical example considered here, one has 


: 10 0 
@=o0x]0 2 0|xo, [151] 

| 0 0 3 
and with Eq. 148 this yields 


2.5 0.2885 0.707 | 
@=|0.2885 15  —0.408 [152] 
0.707 -0408 2.0 


Deleting the third row and column, one has the matrix of the desired 
abridged quadratic form, which reads 


FP = 2.521? + 0.57722. + 1.5%? [153] 


In a more general case in which the given quadratic form F involves 
any number of dimensions (say ~) and ¢ < m linear independent con- 
straints are specified by the equations 


prix + ProXe tise +t PrrXn =0 
Pratji% + Prgijo%2 +--+ + Prtimn%n = 0 [154] 


Pury + Pnote tees + Panta = 0 


in which r = » —i+ 1, one has given ¢ vectors ,, $,41,-++ pn» with 
components defined respectively by the coefficients in these equations 
taken by rows. The ¢ planes orthogonal to these vectors are the constraint 
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planes, and the desired abridged quadratic form should have its variables 
confined to lie in the intersection common to these planes. 

Geometrically it is now convenient to think of the ¢ independent vectors 
Pry Prtiy +++ Pn as occupying a #dimensional subspace immersed in the 
n-dimensional one. The common intersection of the constraint planes 
then occupies a (x — ¢)-dimensional subspace, and these two subspaces 
are mutually orthogonal to one another. One must find a new orthogonal 
set of co-ordinate axes such that the first (~ — #) of these lie in the 
(n — t)-dimensional subspace, while the remaining ¢ of them lie in the 
t-dimensional subspace occupied by the vectors ~;, Pr41,*** Pn- In this 
new co-ordinate system the constraint equations 154 become simply 
E, = E41 = +--+ = E, = 0, and the variables %, Ze, - + - Z,_~ must lie in 
the common intersection of the constraint planes because these lie in the 
(n — t)-dimensional subspace. 

The first step in the process of finding an appropriate set of new co- 
ordinate axes is to determine a mutually orthogonal set of ¢ vectors that 
occupy the same subspace as the constraint vectors ~;, Pr41,°* Pn. If the 
latter happen to be mutually orthogonal, this first step is already done 
(as may be said of the single constraint case discussed above), but in 
general the constraint vectors are any independent set, not necessarily an 
orthogonal one. 

A mutually orthogonal set of vectors occupying the +dimensional 
subspace is any set of ¢ mutually orthogonal vectors each of which is 
expressible as a linear combination of the given constraint vectors. Such 
a set may be formed in an infinite variety of ways. One may begin, for 
example, by choosing #, as the first of the desired set. Then one may 
determine a second vector, orthogonal to p,, and expressed as a linear 
combination of p, and ~,,1. Next one determines a third vector through 
demanding that it be orthogonal to the first two already found, and 
expressed as a linear combination of $,, p41, Pr+2, and so forth. 

Specifically this process may be indicated as follows, letting the desired 
mutually orthogonal vector set be denoted by 9, Br41,°** Dn. To begin 


with, one chooses p, = p,. Next 

Dr as ap, + BPr. 
me ig] : ig [155] 
Dr Dros = apy Pr + Bir: Pri = 0 


in which a and @ are nonzero. Here 8 may be chosen arbitrarily. Thus 
letting 8 = —1 gives 
Pr v Pri 


a as 
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and thus 9,41 is found. Then one writes 
Proo = a! Pe + B’Prgr + y' Proto 
Dr: Drte = 0 [157] 
Pri’ Protez = 0 


in which y’ may be chosen at will and the last two equations solved for 
a’ and @’, thus determining 7,4». | 

In the next step one will have to solve three equations simultaneously 
for three unknown coefficients, etc. If the number of constraints is large, 
the computations may become tedious, but they remain straightforward. 
There are, of course, other ways in which the desired set of ¢ mutually 
orthogonal vectors j, - - - By, may be found, the method given here being 
rather simple in principle and no more tedious computationally than any 
other. 

One must now associate with these ¢ vectors, 7 — ¢ additional ones so 
as to obtain a complete set of m mutually orthogonal vectors 9; - - : Bn. 
This second step is carried out as in the single constraint case. The 
directions of these » vectors are those of the desired co-ordinate axes. 
The orthogonal transformation from the original co-ordinates to these 
new co-ordinate axes has a matrix © whose vector set is p,/ (pil, B2/ Bol, 
-++Bn/|Gn|; that is, a set of unit vectors coincident with p,--- Dn. 
Equation 149 expresses the co-ordinate transformation, and Eq. 150 
gives the matrix of the quadratic form F in the new variables Z;, in terms 
of the matrix of F for the original variables x;,. In terms of the new 
variables 


“F= 2 X@xX 4] [158] 


and the arbitrary linear constraints, expressed in terms of the original va- 
riables by Eqs. 154, are simply 7, = 3,41 =-:: = E, = 0. The desired 
abridged quadratic form is that part of F remaining after terms involving 
the last ¢ variables are discarded. 

Perhaps it might be well to restate what has been done above in another 
way. Specification of the linear constraints 154 is equivalent to demanding 
that the vector x with components x, - - - x, is no longer free to assume 
any orientation in space, but is required always to be simultaneously 

. orthogonal to ¢ arbitrary and independent vectors p, - - - Pn. Now these ¢ 
vectors occupy only ¢ of the ” available dimensions of the given space. 
Hence, so long as the vector x moves about so as to stay outside the ¢ 
dimensional subspace occupied by the constraint vectors, it will fulfill 
the stated orthogonality restriction. For example, for » = 3 and ¢ = 1, 
the vector x is free to move in a plane normal to the single specified con- 
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straint vector; for 2 = 3 and ¢ = 2, the vector x is restricted to the one 
remaining dimension, defined as that direction in space which is orthog- 
onal to the plane (é-dimensional subspace) determined by (or occupied by) 
the two constraint vectors. 

When 1 is larger than 3, one must be mentally sufficiently adaptive in 
one’s thought to comprehend the analogous geometry implied by con- . 
tinuing the identical algebraic reasoning beyond = 3. For example, with 
n = 4and¢ = 2, one must visualize a two-dimensional plane determined 
by the two constraint vectors as defining a corresponding two-dimensional 
subspace, and recognize that there are two other dimensions left over 
which define another subspace (the  — ¢ dimensional one in this case) 
orthogonal to the plane determined by the constraint vectors in the same 
sense that if the vector x remains in this second two-dimensional subspace, 
it remains simultaneously orthogonal to the stated constraint vectors. 

Stated in terms of constraint planes, which are normal to their respec- 
tive constraint vectors, one must recognize that the (m — ¢)-dimensional 
subspace to which the vector x is restricted should be interpreted as a 
resultant common intersection of these planes. Again for » = 3 and 
t = 2, the two constraint planes intersect ina line, so that only one 
dimension is left for the vector x to exist in. However, for » = 4 and 
t = 2, the “intersection ” of the two constraint planes becomes a two- 
dimensional subspace. In general, one must visualize the possibility of é 
planes in an #-dimensional space as possessing a common intersection 
which is (n — ¢)-dimensional, the latter defining a subspace orthogonal 
to the one occupied by the constraint vectors. 

What one is asked to do in the abridgment process is to find z mutually 
orthogonal unit vectors in the »-dimensional space such that ¢ of these are 
linear combinations of the constraint vectors. These ¢ vectors then clearly 
occupy the same subspace as the constraint vectors. Since the remaining 
(n — ¢) vectors are simultaneously orthogonal to the first ¢, they must 
be simultaneously orthogonal to the constraint vectors, and hence they 
define the (x — ¢)-dimensional subspace in which the vector x can move 
and still conform with the restriction that it be orthogonal to all the 
constraint vectors. 

If one chooses new co-ordinate axes coincident in direction with this 
set of mutually orthogonal unit vectors, the matrix of the linear trans- 
formation, expressing the co-ordinates with respect to the new axes in’ 
terms of those relating to the original axes according to Eq. 149, is clearly 
that orthogonal matrix having these unit vectors as its vector set. In 
terms of the new co-ordinates the original constraints are equivalent to 
setting equal to zero those variables that refer to the axes lying within 
the subspace occupied by the constraint vectors, because these simplified 
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constraint equations clearly demand no more nor no less than the original 
ones. 

While there exists an infinite number of possible sets of new co-ordinate . 
axes fulfilling the conditions just stated, note that the two subspaces 
defined by such axes are unique. For example, with » = 3 and # = 2, any 
co-ordinate system of which two axes lie in the plane determined by the 
constraint vectors (the é-dimensional subspace) is acceptable. This plane, 
however, is fixed, and so is the remaining direction normal to it (the 
n — t dimensional subspace) no matter which one of the infinite possible 
choices one makes in determining a specific set of new axes. 

In summary one may say that the discussions of this article show how 
one can convert an arbitrary set of # linear constraints, such as those 
expressed by the Eqs. 154, into an equivalent set having the simple form 
of demanding that ¢ of the variables be zero. The latter will, of course, 
not be the original variables but new ones related to the original ones by 
an orthogonal transformation. In the next article the object is to in- 
vestigate the effect of applied constraints upon the latent roots of a quad- 
ratic form. Since these latent roots are unchanged if the variables in the 
quadratic form are subjected to an orthogonal transformation, the form F 
has the same latent roots when it is expressed in terms of the new variables 
%, ++ E, as it does when it is expressed in terms of the original variables 
’ 4%, +++ 4p. One may therefore say that the effect upon the latent roots of 
F caused by imposing an arbitrary set of ¢ linear constraints may be 
studied without loss in generality by considering only the simple case in 
which the constraints have the form expressed by setting ¢ of the variables 
equal ‘to zero. 


15. THE EFFECT OF CONSTRAINTS UPON THE LATENT ROOTS OF A 
QUADRATIC FORM 


In this discussion only the absolute values of the latent roots are of 
interest. When the quadratic form is positive definite, these roots are all 
positive real numbers, and there is no need to emphasize specifically the 
fact that only their absolute values are to be considered. It is only in the 
more general case in which the latent roots may have negative as well as 
positive real values that such a distinction is necessary. However, since 
the discussion in Art. 10 regarding the iterated forms shows that the 
iterated form of even order can have only positive latent roots (which are 
those of the original form raised to an even power), it is clear that, 
although the present discussion be restricted to the consideration of 
positive definite forms, the conclusions reached apply equally well to the 
magnitudes of the latent roots in the general case. 
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In Art. 5 it is shown that the latent roots \1, \2, °** An of the form 
F= YD auxity, [159] 
ikea 


may be regarded as the extrema of F subject to the restriction of the x,’s 
expressed by 


Sa? a4 ae [160] 


The correctness of this result. may be made evident through considering 
the form F reduced to the normal form 


F = )y3y? + AgEo? + +++ + MnEn? [161] 
by means of the orthogonal transformation 
x] = £2] [162] 


involving the modal matrix &. Because of the orthogonality of the latter 
transformation, the condition 160 is unchanged in form, that is, 


X72 =1 : [163] 
k=l 
In terms of F as given by Eq. 161, and the condition expressed by Eq. 
163, it is clear by inspection that the latent roots are extrema of F. More 
specifically, if the roots are numbered in such a way that 


My > Ag > Ag D> > Nn [164] 


one may see that the largest value of F for the condition 163 occurs for 
Z, = 1, i = 73 = ++: = E, = O, and equals dy. The next largest sta- 
tionary value of F under the same condition occurs for %, = 1, %, = %3 = 

+ = %, = 0, and equals \,; and so forth. 

If the variables x, ---%, in F are subjected to an arbitrary set of ¢ 
independent linear constraints, the resulting abridged form F in n — t 
variables has n — ¢ latent roots which may be numbered so as to conform 
to the sequence 


Rt > Xe > Ng > > Nat [165] 


It is the object of the following discussion to establish relations between 
the magnitudes of the latent roots of F and those of F. 
The constraint equations, which are assumed to have the form - 


uy = Wy =m = = 4, = 0 [166] 


are alternately expressed in terms of the variables %, (related to the x; 
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through the orthogonal transformation 162) as 
p>) Ponte = O (s = 1,2,--- t) [167] 


in which the f,, are elements of &. 


Although F, subject to the linear constraints, has latent roots that are 
different from 1---n, nevertheless the expression 161 for F may be 
used to compute values of F corresponding to any values of the variables 
&,---%,, and hence yields values for the abridged quadratic form pro- 
vided only that the assumed values for %, - - - , conform to the constraint 
equations 167. If in these equations one chooses to let 7:42 = Tip3 = 

-+ = £, = 0, there results a set of ¢ equations in ¢ + 1 unknowns, which 
surely possess a nontrivial solution for %, - - : #41 in agreement with the 
condition 163. For such a set of %,-values, Eq. 161 yields 


F = YyBr2 + obo? ++ + Acq Fegs? S Mega (Br? + ++ Begs?) = Mer [168] 


as may be appreciated by noting the inequalities in 164 and the condition 
163. ; 
Since F, subject to the linear constraints 166 or 167, is the abridged 
form F, the result 168 shows that a possible value of the abridged form is 
at least as large as the latent root 4, of the corresponding unabridged 
form. If the maximum-minimum problem discussed in Art. 5 is applied 
to the abridged form F, one observes that the maximum value of F, 
subject to a condition in terms of its variables similar to 160 or 163, is 
X,. The next largest stationary value is X2, and so forth. Inasmuch as it 
has been shown that a possible value for F is at least as large as 4,41, it is 
clear that the largest value x1 is surely as large as this, that is, 


Xi 2 Ae [169] 

Suppose now that v additional constraints are imposed. upon F, making 

t+ constraints in all. The resulting abridged form, which may be 

denoted by F, can also be regarded as resulting from imposing the set of 

v constraints upon F. If the latent roots of F are denoted and numbered 
according to the sequence 


Ai Ng he ee [170] 
then one may write two additional relations similar to 169 which read 
Ay E he [171] 

and 7 
M1 2 Not [172] 


In any one of the relations 169, 171, or 172 the equals sign holds only 
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if the constraints are chosen in a particular way. Thus for a particular 
form of the v constraints, the equals sign in 172 may be assumed to hold, 
but it will not simultaneously hold in 171 also as long as the original ¢ 
constraints are considered to be arbitrary. Thus the relations 171 and 172 
are seen to yield 


Nop S Mepops [173] 


which establishes relations between all the latent roots of F and a like 
number of those of F, since the integer v can have any value from 0 to 
n—t-1. 

Considering again the interpretation of the latent roots d; - ++ Xn_z of 
F in the manner that those of F are interpreted in Art. 5, bearing in mind 
that F is the result of imposing certain constraints upon F, and hence that 
all the extrema of F are smaller than the respective ones of F, which are 
attainable only if its variables are free to assume particular sets of values, 
one recognizes the following additional relation as being true: 


Ack (s = 1,2,---"—2) [174] 
Together with 173, one may summarize the results so far in the form 
As Z Xs ZS Date [175] 


in which the index s may be given the integer values 1, 2,--- 2 ~t.A 
useful relation between the » — ¢ latent roots of F in terms of the n 
latent roots of F is thus established. 

A case of particular interest is that in which a single constraint is im- 
posed upon F. The latent roots of F and F are then related as expressed by 


Ay 21 ZS Ap SA SAB Sse S Ant Sn - [176] 


These inequalities are sometimes referred to as expressing the separation 
property of the roots of F and F. 

An interesting geometrical view of this last result is had through 
visualizing the ellipsoid associated with F, and the constraint plane as 
slicing the ellipsoid centrally at an arbitrary angle. The intersection of the 
ellipsoid with the plane yields an ellipse with principal axes whose lengths 
are clearly intermediate as compared with those of the ellipsoid. 

With reference to the pair of forms F', and F2, discussed in Arts. 11 and 
12, it may be pointed out that the results of the present article apply also 
to the roots \‘°/®) and those of an abridged pair of forms resulting from 
subjecting their common variables x;---x, to an arbitrary set of 
independent linear constraints. 
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PROBLEMS 


1. Write down the matrices of the following quadratic forms and determine whether 
or not they are singular. 


Fy = 5x1? — 8xixe + 3x3 + 4a, + 2x9? + 4xo%3 — 2xox, + 5x37 + 2x3x4 + 6x4? 
Fe = 8x? + 3x09 + 7x1%3 + Oxex1 + 9x2? — 3xoxg — 3xgx1 + Sxgve + x9? 


2. Show that the matrix of a quadratic form can be written as 


1f oF ; 
¢ = 5[ 1,k = 1,2, “Hn 
3. Transform congruently the matrices 
4 2 1 0 
2 64 -1 Be iets ae 3 -1 
3 10 -1 
1 4 8 3 2 -1 45 —-i1 1 
0 -1 3 10 
respectively with 
53 41 ee { _v3 
3 8 1 1 3 5 7 2 2 
0 3 5§ 0 2 3 1 V3 1 
12 -1 4 3 


4. For what type of transformation matrix is the collineatory. transformation 
identical with a congruent one? Out of the following matrices, pick by inspection 
those that have this property: 


0 0 ek S22 

V5 3 3V5 
21 -1 1 _v3 ae ae S <i 
#55 ad 2 2 gee eee 1 4 1i- 
| 2 V5 3 3V/5 Pear 4 

10 3 V3 1 

0 — = 2 V5 

2 2 0 - a 

3 3 


Check your selection by carrying out the transformation upon the matrix 


5 3-2 
3 10 -1 
—2 -1 15 


5. Compute the latent roots and proper values of the following quadratic forms: 
Fy = py[83x1? + 52xize — 40x1xg + 122x0? + 20xex3 + 65x37] 
Fe = qg[21x1? + 24aixze + 120% 1x3 + 39x27 — 6Oxex3 + 30x37] 
Fs = Ps[49x12 — 4aize — 80x13 + 46x27 +- 40xex3 + 40x57] 
Fa=x2+ 2V 3x1" — x2? 


6. Find the lengths of the principal semiaxes of the ellipsoid represented by the 
quadratic form: 
F = 177x12 + 228x1x%9 — 120x1x%3 + 348x52 + 60xexr3 + 105432 = 45 


176 QUADRATIC FORMS [Ch. IV 


7, Compute the modal matrix of the quadratic form given in Prob. 6 and make an 
isometric plot showing the positions of the principal axes. 


8. Show that the quadratic surface 
2x42 + 5x9? = 6% 0%3 + 5x3? =2 


represents an ellipsoid of revolution. Compute the lengths of the semiaxes and find 
their direction cosines with reference to the given co-ordinate system. Write down the 
modal matrix. Is it unique? 
9. Reduce the following quadratic forms to sums of squares through orthogonal 
transformation and give the respective transformation matrices: 


Fy = 3.501? — 2xieqg + x13 + 1.6x0? + O.4xex3 + 1.925? 
Fe = z¥y[13001? — 284x1vo — 188xixg + 181x207 — 256xerg:— Sag] 
F3 = —2.6x1? + 2.441%2 + 0.6x2 
Fg = 0.4(xy? + WV 32102 + 2x9?) 
10. Reduce the following quadratic form to its normal form: 


F = 0.487x1? — 0.784xix2q + 0.632x%1%3 + 0.445x144 + 0.48722 + 0.632x0x9 
+ 0.445x9%4 + 1.8923? +- O.111442 


What is its rank? 
11. Find the orthogonal matrix which transforms 


F= 0.392%? = 0.94821%2 = 0.632%1%3 + 0.44441%4 + 0.392%? a 0.632x9x8 
— 0.444x0%4 + 1.892%? + 0.111042 


’ to the normal form. Find the latent roots and the signature. 
12. By forming the Gramian determinant, establish the linear dependence or 
independence of the vector set defined by the row matrices: 


134513] 234567 1 68 10 -3 2] 


Repeat for the set 
[1 -1 4 8 2] [3 —3 -2 -1 0] [4 25 8 -8] 


13. By means of elementary transformations find a matrix 2 which will reduce to a 
sum of squares the quadratic form 


F = 4x)? + Saite — $aixg + 7x0? + Broxs + 2x3? 


through a congruent transformation of its matrix @. Determine the corresponding 
diagonal matrix D1-and the signature. 
Through a similar procedure find another matrix P which also yields a reduction to 
a sum of squares. Express the relation between the two diagonal forms D and D’ of 
the reduced matrices by computing the diagonal matrix A in the equation 
D’ = A,DA. Show that it is always possible to find such a real A relating the alternate 
diagonal forms of a symmetric matrix. 
14, Using the following vector set, 


(11-1) [-3 3 5] [-5 0 §] 
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form a matrix @ whose elements are computed as are those in the Gramian determi- 
nant. Find the latent roots of @ and check the positive character of the principal 
minors of its determinant. 


15. If possible, reduce each of the following quadratic forms to its canonic form 
and give the corresponding modal matrix as well as the transformation required to 
pass from the normal to the canonic forms. 


Fy = (83x12 + 52ayq — 40x1%3 + 122x2? + 20xex3 + 65x57]45—1 
Fo = [(49x1? — 4ayxre — 80xixg + 46x29? +-40xer3 + 40x57]45—-} 
16, Determine whether or not the following quadratic form is positive definite: 
F = 3x1? — 4ayxe — 4xyxg + Sx? + 2xexg + 4x? 


17. Find the triangular matrix P that generates the matrix @ of the quadratic 
form 


F =2;?—6x 1x2 — 2x13 +40 1744+13x9?+10xrexg —20x2%4-+11432 — 14034425042 


and check the relation @ = 9,f. 
18. (a) Find the extrema of the quadratic form 


F = 4[-721? — 10V 3x12 + 3297] 


when the variables x1, x2 are subjected to the condition that the point P(x1,x2) shall 
lie on the unit circle. 
(b) Find the extrema of 


F = d[4ai? + 4ayre + 4eixvg + 149x202 — 278x0x3 + 149257] 


subject to the condition that P(%1,x2,73) shall lie on a sphere of radius 2. Determine 
the signature of this quadratic form. 

19. Let @ be the matrix of a positive definite quadratic form, A its discriminant, 
and ¥ a triangular matrix which generates @ according to the relation @ = P,P. 
Show that a possible procedure for the formation of f is given by the relations: 
prj = O for k > 7; dey HO for k Sj. 


Pipi pe Ge N, 

= i= i= 

: Va V a11Bi2 V BisCi28 
Ki Ke—17 VA 


Pug = Pan = So 
V Jey Kink V Mi...(¢n—1) 
Here a1; are the elements of the first row of @. The Bi; are second-order minors 
formed from the first two rows of @ by selecting the first and the jth columns. The 
Cis; are formed from the first three rows of @ by selecting columns 1, 2, and j; and so 
forth. 
20. By using the procedure given in the preceding problem compute a triangular 
matrix ? which generates the matrix of the quadratic form given in Prob. 17. 
21. Show through a proper expansion of the characteristic determinant of a 
matrix that its characteristic equation can be written in the form 
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(-d)" + (-A)" > (principal diagonal elements) 
+ (—d)*-* Dd (principal minors of order 2) 
+ (—d)"-3 ¥ (principal minors of order 3) 


—dD (principal minors of order n — 1) 
+ determinant = 0 


In view of this result, show that the sum of the latent roots of a matrix equals the 
sum of its principal diagonal elements, and that the product of the latent roots equals 
the determinant. Show further that if the determinant is zero, at least one latent root 
is zero; if the matrix is of rank » — 2, at least two latent roots are zero; and so on. 

22. Using the result of the previous problem, show that the roots of the equation 
XN + ag_aA"! + +++ + aid + ao = 0 are the latent roots of the matrix 


0 1 0 0-:- 0 O 


0 0 1 0 - 0 90 

0 0.0 1 0 oO 

0 0 0 0 1 
—do —ay —d2 ene eee —An—-1 


23. Let @ be an nth order square matrix and \j, Ag,-~- An its latent roots. The 
mth power of @ is denoted by B. . 

(a) If the latent roots are real and distinct, and numbered so that |Ail > |Aa| > 
+++ > |dal, show that for a sufficiently large m: 


[Aa|™ = 3% (principal diagonal elements of B) 
[Adal & XO (principal minors of order 2) 


Pe oe 


(b) If the latent roots are real but there are a repeated roots so that 
ra] > [Ao] > +++ > Prol > [otal = otal = +> = otal > otaral > +++ > [al 


derive the relations 


jim DX (principal minors of order j) 
7 


~ 2 (principal minors of order j— 1) PN ee Bere bh kanes Sem 


(principal minors of order g-+1) 
a@ & (principal minors of order g) 


rogal™ee 


(c) Suppose, in the sequence |A1| > |A2| >+++ > [Anl, that the first 4 — 1 roots 
are real but that the roots d, and. A,41 are conjugate complex with the angle @,. In 
this case show that 


bral? x ZX (principal minors of order 4 +.1) 
‘= (principal minors of order # — 1) 


and 
¥ (principal minors of order /) 


2 Vr, | ~ FS nal minore afanderk 1) 
|An|" cos mp, & > (principal minors of order # — 1) 
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24. If F is a positive definite quadratic form in x variables, show that its character- 
istic determinant, when expanded as a polynomial in X, consists of » + 1 terms with 
' alternate algebraic signs, and that the same polynomial with all terms alike in sign is 
associated. with a negative definite form. 


n 
25. Consider the quadratic surface 2° a,«;x, = 1 and the plane 
eet 


ry 
n 
~ Aner, = 


(a) Show that the intersection of the plane with the surface can be expressed as a 
quadratic surface of » — 1 dimensions in the form 


n—1 
> Drgat it" a =1 
rs =1 
and compute the values of the coefficients 6,>. 
(b) As a numerical illustration find the equation of ead ellipse given by the inter- 
section of the ellipsoid of revolution 


ge aie eV 2, Bad 
a b 


and the plane 
—2x1 + 3x2 + x3 =0 


Find the lengths of the semiaxes of this ellipse and compute their direction cosines. 
26. Given the quadratic surface 


n 
Laux, = 1 
i,k=1 


and the parametric equation of a straight line 
xj =a,t + B; J=1,2,---n 


in which a; and 8; are constants and # is a variable parameter. 
(a) Demonstrate that the line cuts the surface at most in two points and that the 
corresponding parameter values must satisfy the equation 


y cucu) 242 % auaiBs)! =1- 


3k =1 i,k = 


p> aunBBr 


1 


7 


Discuss the conditions under which there are 2, 1, or no points of intersection. 

(b) Compute the co-ordinates x’;, and x’, (& = 1, 2,+++m) of the points of inter- 
section. 

(c) If the straight line passes through the origin, show that 


ak 


xk = aly = 
n 
\ x DjfpOjOk 
ik=1 


tka 


and interpret this result according to whether the quadratic form is positive definite or 
not. 

‘ (d} Write down the corresponding expressions appropriate to a quadratic form 
representing a cone. 
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27. Carry through the simultaneous reduction to the normal form of both quadratic 
forms in the following pairs: 


(i) Fy = 2x?"+ 2xize + 49? Fe = x1 + 2xrve — xo" 
(ii) Fy = 2x2 + Qxywe + 2eyg + 3x9” — Qxars + 2x3? 
Fq = 1? + 3x9? — 2xex3 + 2x9? 
(iii) Fy = 201? + 6x1xe + 5x2? = Fe = 2xixe 


28. Compute the matrices ©”, A,”?, and P (see Eq. 101), and the matrices CK) 
and £@! for each of the three reductions in Prob. 27. 

29. Let a11%12 + aea%e? = 1 be the equation of an ellipse when referred to a system 
of oblique axes making an angle ¢. 

Show that the principal axes of the ellipse make angles a1 and a2 with the x1-axis 
that are determined by the relation 


(m? —1) + V (m? — 1)? + 4m? cos? 


tanai.2 = 
2 (nt 41) + V (mn? + 1)? — 4m? sin? 
in which m = a3;/a22; and that these axes can have any desired orientations through 
the proper choice of the ratio m. Observe that the angle between the axes is 90°. 
Write down the expression for a family of ellipses for which one principal axis 
coincides with the xe-axis. : 


30. Consider the pair of quadratic forms: 
Fy, = 3x? — QW 2x 1x09 + 2x9? and Fo = 4? + WwW Ix1H"2 — 2x9? 


Regard the variables x; and x2 as the contravariant co-ordinates of a point P referred 
to a system of axes for which the fundamental metric tensor has the matrix 


ft 
e-[i i 


(a) Make a plot of the curves F; = 1 and Fy = 1 in this system of reference 
co-ordinates. 

(b) Find the transformation of variables which simultaneously reduces both 
quadratic forms to their normal forms. 

(c) Find the orientations of the new co-ordinate axes relative to the old ones. 

(d) Compute the transformed fundamental matrix g. 

(e) Find the orientations of the principal axes of the resultant ellipse with respect 
to the new co-ordinate system. 


31. Consider the pair of quadratic forms 
Fy = 212 — 2xyxq + 2x 3xg + 2xe? — 2xex3 + 3x5? 
Fo = 3x12 — 2xyxq + 2xix3 — x2? — Qxoxg + x3? 


in which x1, x2, x3 are contravariant variables with respect to an oblique co-ordinate 
system consisting of a triad of axes whose mutual angles are all 60°. 

Reduce both F, and Fz to their normal forms through a transformation to a new 
co-ordinate system and find the orientations of the new co-ordinate axes with respect 
to the old ones. 
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32. Find the equation of the intersecting curve between the quadratic surface 
83x17 + 52xize — 40xixg + 122x202 + 20xexg + 652g? = 45 
and the plane 
5x1 + 4x_ — 3x3 = 0 
and compute the lengths of the semiaxes of this curve as well as their orientation with 
respect to the assumed rectangular Cartesian co-ordinates. 


33. For the ellipsoid given in Prob. 32 find equations of the three planes containing 
the principal axes taken in pairs. 


34, Find the maximum perpendicular distance from points on the ellipsoid of 
Prob. 32 to the plane __ : 


—2x1 + xa + 2x3 = 0 
35. Given 
F= gs{21x1? + 24xixe + 120K 1x3 + 39x—? — 6Oxexs + 30x37] 


Find the abridged quadratic form that results if the variables are subjected to the 
condition that they determine a point which is constrained to lie on a plane normal to a 
vector with the components 1, 2, —1, and compute the latent roots of this abridged 
form. 


36. The rectangular Cartesian co-ordinates forming the reference system for the 
ellipse 
3a? + 2xe? + x3? = 1 
are subjected to a transformation to a new set of axes defined by the orthogonal vector 


set 
abs 3 al Ez 1 -A| [0 2 V5 
V5 3 3V5 V5 3 3V5 33. 
Find the equation of this ellipse in terms of the new variables and make plots in the 


new co-ordinate planes of the intersections of the ellipse with these planes. 
37. The real quadratic form 


n 


F= 2 ance 
ik= 
is subjected to the set of ¢ linear constraints 
n 
LD Prete = 0 r=i,2,--+t 
K=1 
and the condition _ 
n 
>» x, =1 
g=1 
In terms of the function 
n t n : 
F=G=F+XM1- >> nt) +2 >» DX MePraXe 
k=1 r=lk=1 


in which \ and py --- i are Lagrangian multipliers, show that the conditions fcr en 
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extremum of F subject to the constraints lead to the equations 


n i , 
2 aunts — te + Ze bebe = 0 (& = 1,2,--+#) 
t= r= 
i 
DY puxe = 0 (ry =1,2,+++t) 
k=1 
Show that this system of equations is sufficient to determine the unknowns 


4°¢* Xn, Mit ** pe, A, and that the extremum of F is given by F = ) in which the 
A-values are roots of the equation 


(a1, — A) G12 Rohe din putt: pe 
a21 (@a2 — dr) - an pis pre 
Ant Qn2 idee (Gun - d) Pin Pin 
Pu pis sn Pin O-- 0 |=0 
par pre Pin 0 0 
pur Die Pin 0 0 


38. Show that the determinant given in Prob. 37 furnishes the same latent roots as 
those corresponding to the abridged quadratic form. Following a method similar to 
that given in Prob. 21, obtain expressions for the coefficients of the various powers of 
d in this characteristic equation. 

39. Illustrate the procedure outlined in Prob. 37 with the quadratic form and the 
single constraint given in Prob. 32 and compare the results with the solution to that 
problem. 


CHAPTER V 


Vector Analysis 


1. PRELIMINARY REMARKS AND DEFINITIONS 


The quantities considered in this chapter are functions of the co- 
ordinates of ordinary (three-dimensional) space. In some cases, they also 
may be functions of other independent variables such as the time. 

A scalar is a function which, for each set of values of its independent 
variables, is completely characterized by a corresponding magnitude. 
If the function is defined for all points within a given region, it is there 
said to constitute a scalar field. Potential functions, such as the scalar 
potential in an electric field or the thermodynamic potential of an ideal 
gas, are common examples. The geographical altitude as a function of 
latitude and longitude is a two-dimensional example of a scalar field. 

A vector is a function which is characterized at each point in space by 
means of a magnitude and a direction. If the function is defined for all 
points within a given region, it is there said to constitute a vector field. 
The earth’s gravitational field of force or the velocity field of a fluid are 
familiar examples. The magnitude of a vector function is a scalar. The 
vector function may, therefore, be thought of as a scalar to which a 
direction is assigned at each point in space. 

More specifically, however, two kinds of vector functions are dis- 
tinguished according to their processes of derivation. Thus, for example, 
the gradient of a scalar potential function is a vector. A simple example is 
the gradient in a mountainous terrain. A vector of a physically different 
nature is that used to represent a mechanical torque. The torque is pro- 
duced by a force acting upon a lever arm, and the resulting vector (by 
convention) stands normal to the plane determined by the force and the 
arm; that is, it coincides with the axis of rotation. The direction of the 
torque vector, moreover, must be defined in accordance with a right- or a 
left-hand screw rule. 

These two types of vectors, such as a gradient and a torque, are dis- 
tinguished respectively by the adjectives polar and axial. This distinction 
is not merely a superficial one which may be disposed of by the simple 
process of propounding a pair of suitable adjectives. One reason for 
making such a distinction is brought to light when the variables of these 
vector functions are subjected to a co-ordinate transformation, such as 
changing from a right-hand to a left-hand system of rectangular axes 
(see Fig. 1). In this case, the algebraic sign of the axial vector function is 
reversed and that of the polar vector is not. If both types of vector func- 
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tions are involved in a given problem, this circumstance must be care- 
fully considered. 

An axial vector may be the result of a vector product formed from two 
given vectors. It must be observed, however, that this is the case only if 
both the given vectors are either axial or polar. The vector product 
formed from a polar and an axial vector is polar. 

Since a scalar may be the result of a scalar product of two vectors, it 
appears that this question regarding the distinction between two types 


2 x 
y 
e ° 7 
a bi 
right-hand cartesian axes left-hand cartesian axes 


Fic. 1. Two systems of cartesian axes. 


of vectors is not confined to vector functions. Thus a scalar function 
which results from the scalar product of a polar and an axial vector has 
different mathematical properties from those of a scalar function which 
is the scalar product of two polar or two axial vectors. The first of these 
functions reverses its algebraic sign when subjected to a transformation 
from a right- to a left-hand co-ordinate system; the second does not. The 
latter is invariant to any co-ordinate transformation, as a true scalar 
should be. An energy function is a scalar of this type. The other kind of 
scalar function, which is also encountered in physical problems, is called 
a pseudoscalar, since it has all the properties of a scalar except that it is 
not invariant to certain types of co-ordinate transformations. 

The product of a scalar and a vector yields a vector of the same type. | 
Multiplication with a pseudoscalar, however, changes an axial vector 
into a polar one, and vice versa. It should also be observed that the addi- 
tion or subtraction of vectors or scalars should not be carried out without 
regard to their type or origin. 

In order that the geometrical visualization of it may be facilitated, a 
field is commonly pictured as associated with a system of so-called flow 
lines. In hydrodynamics, for example, these are the paths which are 
traversed by the component particles of the fluid. Since the velocity 
vector is tangent-to these trajectories at every point within the flow 
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region, certain physical characteristics of the vector field itself can be. 
recognized from such a system of flow lines (flow map).* 

The magnitude of the function representing the vector field at any 
point is given by the density of flow lines in the surface normal to the 
vector at that point. The number of lines chosen to represent unit density 
is, of course, arbitrary, but it is significant to observe that since the 
magnitude is a continuous function (except at certain points or surfaces 
where lines begin or end), all points within the field must be thought of 
as occupied by lines. 

A given bunch of these lines, moreover, cannot intertwine with each 
other, because their continuous distribution would then necessitate a 
crossing of lines at some points, and this is impossible since their direc- 
tions must everywhere be unique. The lines defining the longitudinal 
surface of a given bunch form what is known as a tube. Unless such a 
tube contains regions from which lines emanate or upon which they 
terminate, the total number of lines enclosed by it must evidently 
remain constant throughout the flow region defined by the tube. 

From this more physical point of view, vector fields are distinguished 
according to either of two characteristically different (yet in a sense 
complementary) properties which they may separately or simultaneously 
possess. Thus if the flow map exhibits lines which close upon themselves 
(are endless), the field is said to exhibit turbulent characteristics. If none - 
of the flow lines close upon themselves, the field is said to be nonturbu- 
lent.+ In connection with the latter statement it must be recognized, of 
course, that a flow map for a finite region may exhibit no closed paths, 
yet the greater field may be turbulent, for some of the paths may close 
outside the finite mapped region. 

A vector field which is solely turbulent Galiapaatively called rotational 
or solenoidalt) is associated with a flow map containing closed paths 
only. Figure 2 shows an example of such a field. A nonturbulent field 
(also called an irrotational or a potential field) is associated with a flow 
map in which all the lines begin at a source and end upon a simk (or nega- 
tive source). For this reason, the potential field is sometimes referred to 
as a source field, and the turbulent one as a source-free field. Figure 
3 illustrates an irrotational field. 

In the sense that a source is the cause or origin of any field, the turbu- 
lent field must, of course, also have its sources. These, however, are 

*These matters are discussed in greater detail in Electric Circuits, pp. 23-71. 

{The term lamellar is sometimes used to describe such a field. 

{A solenoid is a channel or tube. The term “ solenoidal ” does not appear to be particularly 
appropriate because tubes of flow lines can also be mapped in a potential field. The important 
characteristic of the rotational field is the fact that its tubes close upon themselves, each 
forming an endless conduit containing the same number of flow lines throughout the length 
of its circuit. 
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referred to as vortexes. They are the whirlpools or eddies in which the 
field has its seat or origin. A turbulent field is sometimes also called a 
voriex field. 

An arbitrary vector field can exhibit both turbulent and nonturbulent 
characteristics, caused by the simultaneous presence of sources and 
vortexes. The turbulent and nonturbulent components of this field are, 
however, linearly independent. In other words, an arbitrary vector field 


rotational field 


vortex, vortex 


Fic. 2. A vector field which is solely turbulent. 


may always be represented as the linear superposition of two independent 
components, one of which is a purely turbulent and the other a purely 
potential” field. 

These matters, together with a number of useful vector operations and 
their interrelations as well as the geometrical interpretations of them, 
are discussed in the following articles. One of the various operations 
encountered here is the linear transformation discussed in Ch. III. The 
coefficients of this transformation (elements of its matrix) may be 
functions of the space co-ordinates, so that for each point in space, a 
particular vector transform is associated with any given vector. This 


*This rather common designation for a nonturbulent field is appropriate because a vector 
function representing the gradient of a scalar potential is inherently nonturbulent, as is shown 
in detail subzequently. 
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transformation function is called a fensor* of valence 2. The order of the 
tensor is that of its matrix. For ordinary space, therefore, the tensor is 
of the third order. The coefficients of the transformation are referred to 
as the components of the tensor. 

A tensor of higher valence is a function which, at every point in space, 
associates a tensor of the next lower valence with any given vector. A | 


mk SF 


Fic. 3. A nonturbulent field. 


tensor of order 7 and valence v has n” components. For example, a tensor 

of the third order (for ordinary space) and valence 3, has 3° or 27 com- 

ponents. Its matrix may be regarded as a three-ditnengional array. In 

fe classification, a vector is sometimes referred to as a tensor of valence 
1,t and a scalar as a tensor of valence 0. 


*The name éensor originated when this kind of function was Brat used i in connection with 
problems dealing with stresses in elastic media. ; 

{In this interpretation of a vector, its components are regarded as defining a row matrix, 
and the linear transformation (tensor) corresponding to this matrix is a single linear equation 
like one of the equations of the set 3, Ch. III. This transformation is seen to transform a given 
vector x into a scalar (single component), but the classification of a vector as a tensor of 
valence 1 nevertheless appears to be inaccurate because it confuses the vector with a trans- 
formation. It would be more proper to say that the components of a vector (not the vector 
itself) may be regarded as those of a tensor of valence 1. 
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Since the analysis of a physical problem usually requires a co-ordinate 
system of some kind, the various vector operations discussed in this 
chapter are expressed not only in vector form but also in terms of an 
assumed system of co-ordinates. In most of the detailed formulations, 
the rectangular Cartesian system, being the simplest, is chosen. Unless 
‘mention is made to the contrary, a right-hand system of axes is assumed. 
The latter is so named because a right-hand screw, turning in the direc- 
tion of the shortest route from the positive x-axis to the positive y-axis, 
advances in the direction of the positive z-axis. Transformations of the 
important vector operations to some of the orthogonal curvilinear co- 
ordinate systems more frequently encountered in practical problems are 
given in a subsequent article. 


2. THE SCALAR PRODUCT 


The scalar product of two vectors A and B (also called the inner 
product) is defined as the product of their magnitudes multiplied by the 
cosine of the angle between them. The scalar product is denoted by a 
dot placed between the symbols A and B; hence 


A-B=|A| [B| cose [1] 


in which 6 is. the angle included between the two vectors.* 

According to this definition, the scalar product may alternatively be 
regarded as the product of the length of either vector with the projection 
of the other upon it. If one of the vectors, for example, B, is given by the 
vector sum (according to the parallelogram law of addition) of two other 
vectors, as in the expression 


B=C+D [2] 
then 
A-B=A-(C+D)=A-C+A-D [3] 


This result is seen to be true because the projection of B upon A is evi- 
dently equal to the sum of the projections of C and D upon A. Hence the 
distributive law holds for scalar products. 

When it is necessary to express the scalar product in terms of the 
components of the vectors A and B with reference to a rectangular 
Cartesian co-ordinate system, it is convenient to define a set of unit 
vectors having the directions of the x-, y-, and z-axes. These unit vectors 


*According to this notation, which is attributed to Gibbs, the scalar product is sometimes 
also referred to as the dot product. An alternative notation quite frequently found in the 
literature is to indicate the scalar product by enclosing the symbols for the two vectors in 
parentheses, thus: A- B = (AB). 
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are denoted respectively by the letters z, 7, and &. The components of the 
vectors — that is, their projections upon the x-, y-, and z-axes — are de- 
noted respectively by Az, A,, Az, and B,, B,, Bz. 

The vectors themselves may then be written 


A =1d,+jA, + RA, [4] 
and 
B=iB,+jB,+ kB, 


The terms in these equations are vector components, and the right-hand 
sides represent vector sums. The scalar product of A and B may now be 
written 


A-B= (iA, +jAy + kAz)- (Be +jBy + kB,) [5] 


Since the distributive law holds, the right-hand side of Eq. 5 may 
be replaced by the sum of nine component scalar products such as 
(iA,) - (¢Bz), (Az) - (7B,), etc. The unit vectors z, 7, k are mutually at 
right angles to each other. Hence the scalar product of any one of these 
with any other one is zero, and the scalar product of any one with itself 
is unity; that is 

t-juj-k=k-i=0 , 
cane [6] 
tet=fg=k-k=1 
Consequently, only three of the nine component scalar products resulting 
from Eq. 5 have nonzero values, so that 


A-B= A,B, + AyB, + A,B; 17] 


The scalar product is thus given by the sum of the products of the corre- 
sponding components of the two vectors. This result is true only for a 
rectangular co-ordinate system (discussed in Art. 4, Ch. ITI) and no 
longer holds when the co-ordinate axes make oblique angles with each 
other. 

It is clear from the definition of the scalar product that the commutative 
law holds; that is, 


A:-B=B-A [8] 
Since the scalar product of two vectors yields a scalar, a triple scalar 


product such as might be denoted by A - B-C has no meaning, and the 
question whether the associative law holds hence does not arise. 


3. THE, VECTOR PRODUCT 


The vector product of two vectors A and B (also called the outer 
product) is defined as a vector whose magnitude is given by the product 
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of the magnitudes of A and B, multiplied by the sine of the angle between 
them. The direction of the vector product is related to the directions of 
A and B by the right-hand screw rule in the sense that a right-hand screw, 
turning in the direction of the shortest route from the tip of A to the tip 
of B (assuming that these emanate from a common point*), advances in 
the direction of the vector product. 

In connection with this definition, it is significant to observe that the 
angle @ between A and B, which enters into the determination of the 
magnitude of the vector product, is that angle through which the right- 
hand screw defining the direction of the resulting vector is turned in 
passing from the tip of A to the tip of B. This is usually taken to be the 
smaller of the two supplemental angles through which it is possible to 
turn in passing from one to the other of any two coterminous vectors A 
and B. The result, however, is the same if the larger of these two angles 
is chosen, because the direction is then reversed and the algebraic sign of 
the magnitude is reversed also. 

In the Gibbs notation, the vector product is denoted by a cross, placed 
between the symbols for the two vectors.} Thus the magnitude of the 
vector product is expressed by 

|A x B| = |A| |BI sine [9] 


This expression is recognized geometrically as being numerically equal to 
the area of the parallelogram defined by the two coterminous vectors A 
and B. 

If the cosines of the angles between the normal to the surface of this 
parallelogram (in the direction of the vector A x B) and the x-, y-, and 
g-axes of a rectangular co-ordinate system are denoted respectively by 
cos (2,x), cos (n,y), and cos (7,2), then 


V, = |A xB cos (1,x) 
V, = |A « Bl cos (ny) [10] 
V, = |A «Bl cos (1,2) 


represent the projections of the area of the parallelogram upon the yz, 
zx, and xy planes respectively. Since 


cos” (n,2) + cos? (n,y) + cos? (mz) = 1 [11] 


*In other words, the vectors are for the moment assumed to be coterminous, as they 
usually are. The definition of the vector product (and the scalar product also) is, however, 
independent of whether the vectors are coterminous. If they are not, then the direction of 
the vector product may perhaps be more easily visualized by first displacing one of the vectors 
parallel to itself until the two become coterminous. 

{For this reason, the vector product is sometimes also referred to as the cross-product. 
An alternative method for indicating the vector product is to enclose the symbols for the two 
given vectors in square brackets, thus: A x B = [AB]. : 
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it follows that 
|Ax« Bl =VV,2+V,7+ V2 [12] 


and hence V,, Vy, and V, are seen to be the components of the vector 
product 


V=AxB [13] 


In other words, the component areas given by Eqs. 10 are recognized 
as having the properties of vector components, and their vector resultant 
is identified with the vector product as defined above. 

Observe in this connection that a component area (projection of the 
surface of the parallelogram determined by the vectors A and B upon one 
of the three co-ordinate planes), as defined by one of the three Eqs. 10, 
is not merely a geometrical projection, but in addition involves an alge- 
braic sign which reverses if the direction of the normal is reversed (re- 
placing the angles of the cosine functions by their supplements). That is, 
the algebraic signs of the components 10 are controlled by the right-hand 
screw rule for the vector product. 

If the projections of the vectors A and B upon the ye, 2x, and xy planes 
are denoted by A“, B®), and so forth, it follows from these considera 
tions that the vector components of V are given by 


iV, = AW® x BY® 
jVy = AG x BO? [14] 
kV, = AG” x BOY” 


The magnitude of one of these components, such as V,, for example, is 
given by 
V, = |Aw)| [Be| sin Oye ; [15] 


in which @,, is the angle included between A“? and BY). Replacing this 
angle by the difference between the angles which these vectors separately 
make with the y-axis, applying the trigonometric identity for the sine of 
the difference between two angles, and noting that the components of 
A» and B® are those of A and B on the Y and Z axes, it is found that 


V, = A,B, — AzBy [16] 
and similarly that 

V, = A.B, — AzB, [17] 
and 

V, = A,B, — A,B, [18] 


These are the components of the vector product expressed in terms of 
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those of the two given vectors A and B. If the vector B is the resultant 
of two other vectors, that is, if 


B=C+D [19] 
the decomposition of C and D into their rectangular components, and 
substitution into Eqs. 16, 17, and 18 show that 

V=AxB=Ax(C+D)=AxC+AxD [20] 


Hence it follows that the distributive law holds with regard to the vector 
product. 

Conversely, if the distributive law is assumed to hold, the vector 
product 


AxB= (iA; +jA, + kAz)* (iB, + jBy + kBz) [21] 
may be replaced by the sum of nine component vector products. Ac- 
cording to the definition of the vector product, 

txt =jrxj=kxk =0 [22] 
and 
4 x7 =k= —j xg 
jrh=i= bef [23] 
kxi = j = —ixk , 
so that three of the nine terms represented by Eq. 21 become zero, and 
the remaining six yield 
A x B=i(A,B,—A,By)+j(4.B1—-AzB.) +k(AzBy—A,Bz) [24] 
This result is seen to agree with that stated by Eqs. 16, 17, and 18. 


It is useful to recognize that the vector product may be written in the 
following determinant form: 


t j ek 
A, A, A, 
B, B, Bz 
The Laplace expansion of this determinant in terms of the elements of 
its first row yields the vector product in the form given by Eq. 24. 


Although the distributive law holds for the vector product, the com- 
mutative law evidently does not, since 


AxB=—BxA [26] 


AxB= [25] 


The triple vector product 
AxBx«C [27] 


has a unique meaning only when the order in which the products are to 
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be carried out is indicated. Thus in the association indicated by 
Ax[BxC] [28] 


the product Bx C is formed first, and then the vector product of A with 
this vector is determined. The result is normal to Bx C and hence is a 
vector which lies in the plane determined by the vectors B and C. In the 
association indicated by 


[A xB] «C [29] 


the product [A x B] is formed first, and then the product of this vector 
with the vector C is determined. The result in this case must be normal 
to 4 xB, and hence is a vector which lies in the plane determined by 
the vectors A and B. 
Evidently 
A x[BxC] # [A « B]«C [30] 


_ so that the associative law does not hold for multiple vector products. 

Since the resultant vector for the triple product 28 lies in the plane of 
the vectors B and C, it must be possible to caprese this vector as a linear 
combination of B and C, that is, 


A x[BxC] = BB + yC [31] 


An evaluation according to the form given by Eq. 24 for the vector 
product shows that 


Ax[BxC] = (A-C)B— (A-B)C [32] 
which agrees with Eq. 31, in which 
B=(A4-C) and y=-—(A-B) [33] 
The triple product 29, on the other hand, is 
[Ax B]x«C = —Cx[A~« B] = Cx«[Bx A] [34] 


This has the form of the triple product in Eq. 32 with A and C inter- 
changed. Hence 


[Ax B]«C = (A-C)B — (B-C)A [35] 
which is a vector lying in the plane determined by the vectors A and B, 
as stated above. 
4. THE SCALAR TRIPLE PRODUCT 


The following combination of a vector and a scalar product 
A-[Bx«C] [36] 


in which A, B, and C are arbitrary vectors, is called a scalar triple product. 
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The definition of the scalar product in the form given by Eq. 7, together 
with the determinant form 25 for the vector product, shows that the 
scalar triple product may be expressed as the value of the determinant 


A, A, Az 
A-[BxC] =| B, B, B; (37] 
Ce C...“C, 


The result is, of course, a scalar. 

Since the transpose of a determinant has the same value,* the rows in 
the determinant 37 may be written alternatively as columns. The value 
reverses its algebraic sign when any pair of rows are interchanged, but it 
remains unchanged if this interchanging is done twice in succession. Since 
the cyclic order of the letters 4, B, C can be changed to B, C, A by two 
interchanges, and to C, A, B by two more interchanges, it follows that 


A-[BxC] = B-[CxA] =C-[A«B] 38] 
A 
uPxe oe 
triple product 3 a va 
A- (Bx) af yd 


hy= projection of vector A upon BxC 
{BxC\=area of parallelogram OBPC 


Fic. 4, A geometrical interpretation of a scalar triple product. 


A geometrical interpretation for the scalar triple product is readily 
given as shown in Fig. 4. The magnitude of [B « C] equals the area of the 
parallelogram determined by B and C; its direction is normal to the plane 
of this parallelogram. If the scalar product is interpreted as the length 
of the vector [BxC] multiplied by the projection of A upon it, the scalar 
triple product is seen to be equal to the product of the area of the 
parallelogram determined by B and C, multiplied by the component of A 
normal to this surface. The result evidently represents the volume of the 
parallelepiped of which three coterminous edges coincide in length and 


*See Art. 2, Ch. I. Specifically, this is there stated as the property VIII. 
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direction with the three vectors A, B, and C. The parallelogram deter- 
mined by B and C is regarded as the base of this parallelepiped, and the 
normal component of A is its altitude. In the equivalent forms given by 
Eq. 38, the base of the parallelepiped is alternatively regarded as defined 
by the vectors C and A, or A and B. These alternate interpretations are 
illustrated in Figs. 5 and 6. Each of the three expressions in Eq. 38 
represents the volume of the same parallelepiped. 


Oikos 
triple product ve . Jewctaeee 
B+(CxA) wv} triple product At a 
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hg = projection of vector B upon Cx A 
1C'x Al= area of parallelogram OCQA 


Fic. 5. Alternate interpretation of 
a scalar triple product. 


hy= projection of vector C upon Ax B 


\AX Bi=area of parallelogram OBSA 


Fic. 6. Alternate interpretation 
of a scalar triple product. 


In connection with this scalar triple product, observe that the brackets 
enclosing the vector product of B and C in the expression 36 are signifi- 
cant because they indicate the order in which the operations are to be 
carried out. Thus, if this triple product were written A - BC, it might 
be thought that A - B could be carried out first if desired. This procedure, 
however, yields a scalar, and the subsequent vector product with C then 
has no meaning. 


5. THE GRADIENT 


A single-valued scalar function of the space co-ordinates x, y, z is 
denoted by the symbol U. It is a function of position or location only. 
The points in space at which U has a given value, for example, C, define a 
surface which is referred to as a constant-value surface. Any number of 
such surfaces, for various assumed values of the constant C, may be 
mapped. In particular, it is expedient to map a series of constant-value 
surfaces for values of C which differ by integer multiples of some chosen 
interval. A familiar example of such a map is that for the two-dimensional 
altitude function in a geographical terrain. Here the function U is constant 
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along lines instead of surfaces. These are called contour lines, and the 
resulting plot is spoken of as a contour map. 

Such a map places the variation of the function U in evidence, since 
the function evidently changes slowly in those regions where the lines or 
surfaces are far apart, and rapidly where they are closely spaced. The 
rate at which U varies in any given direction at a point in space is 
determined approximately by the ratio which the interval chosen for the 
constant C-values has to the distance measured between two neighboring 
surfaces in the given direction at the point in question. If the interval in 
the C-values at this point is allowed to become smaller and smaller, the 
corresponding limiting value of the ratio accurately yields the desired 
rate of change of U. This value is called the directional derivative of U. 

It is apparent that the directional derivative of U is a maximum at'a 
given point if the derivative is taken in a direction normal to the constant- 
value surface passing through that point, because the distance between 
neighboring surfaces is evidently smallest in the normal direction. This 
maximum value of the directional derivative is called the normal deriva- 
tive of U. 

As a function of the space co-ordinates, the normal derivative of U 
appears to have the properties of a vector function. The truth of this 
statement may be seen from the fact that if dw is the differential distance 
in the direction of the normal between two neighboring constant-value 
surfaces for which C differs by dC, and if ds is the distance between these 
surfaces in any other direction s, then, except for differentials of higher 
order, 


dn = ds cos 6 [39] 


where 6 is the angle between the normal and the direction s. It follows that 
Ce ee ee [40] 


in which 9U/dn is the normal derivative of U. 

This result shows that if the normal derivative is regarded. as a vector 
pointing in the direction of the normal, the derivative of U in any direction 
sis given by the projection of this vector upon a line having that direction. 
The normal derivative, therefore, has true vector character. This vector 
is called the gradient of U at the point at which the normal derivative is 
evaluated. 

The gradient is defined as pointing in that direction in which VU in- 
’ creases and it is evidently a function of the space co-ordinates, since its 
magnitude and direction depend upon the point at which 9U/an is 
evaluated. For a geographical altitude function, the gradient at any 
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point indicates the direction of steepest ascent, and its magnitude equals 
the maximum rate of change of altitude with distance at that point. 

In equations, the gradient is written in the abbreviated form: grad U. 
If m denotes distance measured along the normal at any point in the 
direction in which U increases, and m; represents a unit vector in this 
direction, the gradient is expressed by the vector equation 


grad U = ny ae [41] 


If s; denotes a unit vector in any direction s, the component of the 
gradient in that direction is given, with the help of the scalar product 
and Eq. 40, by 

ou aU aU 


grad, U = s - grad U = 1+ 2 = 3 cos = = [42] 


In a rectangular co-ordinate system, the components of the gradient are 


=. oU 
grad, U = ae grad, U = ay grad, U = ~ [43] 


grad U =i 5 +) tee [44] 


A more compact form for this expression is obtained by defining the 
so-called Hamiltonian operator* 


.9 , 9 é 
WF gl ay ae [45] 
The equivalent of Eq. 44 then reads 
grad U = VU [46] 


The operator V may in some respects be formally treated. as a vector 
with the components 0/dx, 8/dy, 0/dz. These, however, cannot in general 
be manipulated as though they were ordinary algebraic coefficients. 
They are differential parameters (or operators) of the first order, and 
hence perform the operation of differentiation upon whatever function 
follows them. 

For example, if W is another scalar function, the rule for differentiating 


*Since the symbol for this operator is an inverted Greek capital delta, it is frequently 
referred.to by the name “ del,” and grad U is alternatively called “ del of U.” Another name 
for the operator V is “nabla,” after the Greek name for a harp, which this symbol re- . 
sembles in form. 


198 VECTOR ANALYSIS [Ch. V 


a product shows that 
V(UW) = UVW + WVU [47] 


or 
grad (UW) = U grad W + W grad U [48] 


Another example is the scalar triple product 36 in which the operator V 
takes the place of the vector A. Here the relation 38 is not applicable 
without due attention to the implied operation of differentiation in the 
multiplication of V with the functions B and C. The correct evaluation of 
the expression V - [BC] is given in a subsequent article (see Art. 16) 
after the operations V- A and Vx A have been discussed. The present 
remarks are made merely to caution the reader against any careless ma- 
nipulation of the vector operator V. _ 

An important property of the gradient comes to light from a con- 
sideration of the so-called line integral of 
this vector function evaluated for an arbi- 
trary path extending between any two 
points in space. Thus if 


f = grad U [49] 
then the integral 


[fr ds [50] 


is referred to as the line integral of the 
vector function f between the points 
a(%1,91,21) and b(xX2,¥e,22). These two 
points are assumed to be connected by acon- 
Fic. 7. A scalar product in- tinuous path or curve S of arbitrary form. 
volving the differential vector Distance along this curve is denoted by the 

sa symbol s. The differential vector distance 
ds is at any point tangential to the path in the direction of continuous 
progress along it from a to 6, and in magnitude equals the scalar increment 
of length ds. The scalar product f- ds then equals the component of f 
coincident with the direction of travel along the path at any intermediate 
point, multiplied by the corresponding path increment. This relationship 
is illustrated in Fig. 7. 

If f represents the force on some particle which is constrained to follow 
this path (like a bead on a bent wire), in the absence of friction the line 
integral 50 evidently yields the total work done by the force as the particle 
travels along the path from a to 6. 


6 
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According to Eqs. 40 and 41, it is recognized that 


f-ds=7,- ass = “as = av [51] 


in which dU is the differential increment of work done by the force f in 
moving the particle through the path increment ds.* Since U is a function 
of position only, dU is a total differential. Hence 


b ” 
fr ds = f grad U- ds = f dU = U(%2,y0,22) — U(a1,91,21) [52] 


From this result it is concluded that the line integral of the gradient 
between any two points is independent of the path joining these points. 
Hence if the particle is subsequently returned to the point @ along any 
other path from bd to a, the total work done by the force f as the particle 
traverses the closed circuit from @ to 6 and back to a, is zero. 

Symbolically, the line integral extending over a closed circuit is 
indicated by a circle placed upon the integral sign. Thus the vector func- 
tion defined as the gradient has the property that 


f grad U- ds = 0 [53] 


It is important to observe that the truth of the results expressed by 

Eqs. 52 and 53 depends upon the single valuedness of the function U. 
This restriction on the function U, which is stated in the opening para- 
graph of the present article, is not always met by functions dealt with in 
practical problems. Further considerations, necessary when J is multi- 
valued, are given in Art. 14 of this chapter. 
' In the above argument, the force f is regarded as due to some rer 
agency which is causing the motion of the particle along a given path. 
This external or driving force must, of course, be balanced by an equal 
but oppositely directed force of reaction. The latter is due to the inherent 
properties of the medium or system through which the particle is moved. 
If F denotes this force of reaction, evidently 


F = —grad U [54] 
In this connection, U is spoken of as the potential function of the sys- 


tem, and F is the vector field of force associated with U. If Up denotes 
the value of U at some chosen datum point, then U — hs represents the 


0U 0U 
*One may alternatively verify this conclusion by writing f = i< +j— By +h a 
and ds = idx + j dy + k dz, whence, according to Eq. 7, 


ov 
fds =O ant Fay +S de = dU 
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work which must be done upon the particle to move it from the datum to 
the point to which U refers. This work is called the potential energy of the 
system consisting of the given particle and the medium in which it is 
embedded. ; 

A simple example is the potential energy of a particle (usually con- 
sidered to have unit mass) located at some altitude above sea level. 
The function F is then referred to as the earth’s gravitational field 
of force. Here the significance of the relation 53 is readily visualized. 
Thus, whatever work may be done by the external force f while the 
particle traverses part of the closed path is returned to the external agency 
during the traversal of the remainder of the circuit. This would be the 
case, for example, if one were to carry some object around a closed path 
on the side of a mountain. 

The work which the external agency may contribute during the 
traversal of a certain portion of the path is thought of as stored by the 
force field through which the particle is moved, and the significance of 
Eq. 53 is that such stored energy is not lost but may be completely 
regained. A force field which has this property of conserving whatever 
energy increment may be imparted to it is called a conservative field. This 
is the property of a vector field defined by the gradient function. 

Such a vector field is evidently irrotational in character, for if the flow 
map for this field were assumed to contain any lines which close upon 
themselves, one of these could be chosen as the path for the integral in 
Eq. 53, and the value of this integral would then certainly not be zero. 
Hence the gradient always defines a purely potential field, all the lines of 
which must emanate from sources and terminate upon sinks. A conserva- | 
tive field is always irrotational. 

Conversely, if for a given vector field F it is known that 


PF -ds = 0 [55] 


it must be possible to define a potential function U such that F is given by 
Eq. 54. 


6. THE DIVERGENCE 


The sources of a potential field are sometimes thought of as con- 
centrated at points or distributed along filaments or over surfaces. 
Whereas such source distributions may, in the discussion of certain 
physical problems, be convenient from an analytic point of view, they 
_nevertheless are idealizations which require a proper mathematical 
interpretation. For the present discussion, the sources of a potential 
field are considered to be continuously distributed throughout space, 
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According to the hydrodynamic analogy, a region in which sources are 
located is one from which fluid emanates in a continuously distributed 
fashion, like oil seeping up through the pores of a bed of quicksand. (The 
pores as well as the rates of flow through them are to be thought of as 
being infinitesimal.) The object of the present discussion is to formulate 
some means for describing the fluid productivity of an infinitesimal 
element of space in a source-filled region. In other words, some measure 
of source density or intensity is needed in order that a given distribution | 
of sources may be described and the relation of this distribution to the 
associated field intensity may be determined. 

The net rate at which fluid emanates from a small but finite productive 
region may be measured through integrating the rate of flow over a 
surface enclosing this region. If this net rate is divided by the enclosed 
volume, the resulting figure represents an average rate of productivity 
per unit volume for this region. The actual rate of productivity per unit 
volume may, of course, vary from point to point throughout the region. 
At any given point, it is expressible as the limit of the average rate 
obtained through shrinking the enclosure about that point until the 
contained volume bécomes infinitesimal. 

This limiting value is a convenient measure of the intensity of the 
source region at any point, and it is referred to as the divergence of the 
flow field at that point. Although the flow field is a vector function, its 
divergence is clearly a scalar. 

If A is any vector function, then in accordance with the discussion 
just given, its divergence (abbreviated div A) may be mathematically 
defined by 


§ A-da 


div A = limit { —teedsurface_ [56] 
; Sdo—0 f dv 


enclosed volume 


The circle on the integral sign in the numerator indicates that the inte- 
gration extends over a closed surface. In this integral, da is a vector surface 
increment. Its magnitude equals the scalar differential area da at any 
point on the surface, and its direction is that of the outwardly directed 
normal at that point. The scalar product A - da, therefore, represents the 
product of the normal component of A and the scalar differential area da 
at the same point. This equals the rate at which fluid passes through the 
surface element da, if A is thought of as representing the velocity field’ 
of a fluid. ib 

The integral in the numerator of Eq. 56 is then seen to equal the total 
rate at which fluid passes outward through the closed surface. The integral 


: 
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in the denominator of this expression represents the enclosed volume. 
The resulting limit of the ratio of these two integrals yields the divergence 
of A at the point about which the closed surface is shrunk by the limiting 
process. Its value is in general different for different points in space. 

In order for div A to be calculated when the vector function A is 
given, Eq. 56 must be further evaluated. For this purpose, the enclosed 


Y 


Fic. 8. Enclosed volume assumed in the calculation of div A. 


volume is effectively assumed in the form of a rectangular parallelepiped 
with its center located at the origin of a rectangular co-ordinate system, 
and three of its coterminous edges coincident in direction with the co- 
ordinate axes as shown in Fig. 8. If the sides of this parallelepiped are 
identified with the differentials dx, dy, dz, the desired result is at once 
_ obtained without the necessity of subsequently carrying out the limiting 
process. 

The vector function A is assumed to be finite and continuous in the 
vicinity of the parallelepiped, so that, except for differentials of the 
second order, the variation of A is linear throughout this infinitesimal 
region. If A,, Ay, Az denote the values of the components of A at the 
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origin, for the surface of the parallelepiped 


OA, 
f4-da= (40+ as Fay as — (4. - 9 +) ay d 


“Ox. Ox 


aAy dy ( dAy dy 
+ (4, + ay 2 dz dx Ay ay 2 dzdx [57] 
0A, dz 0A, dz 
+ (4. + a Fa dy - (4. eles +) aedy 
which yields 
0A oA 0A 
A sda [oe ; 
f a (S ae A2) ae dy ds [58] 
The enclosed volume is given by 
fa = dv = dx dy dz [59] 
Hence, in rectangular co-ordinates, Eq. 56 evaluates to 
‘ aA, dA, , 0A, 
er ore [60] 


By means of the Hamiltonian operator defined ‘by Eq. 45, and the 
form for the scalar product given by Eq. 7, the result stated in Eq. 60 
may be written 


divA = V-A [61] 


It is significant that the partial derivatives in Eq. 60 are always to be 
evaluated at the same point in space at which the divergence of A is 
desired. If the result is numerically positive, the point in question is a 
source; if it is negative, the point is a sink. If the result is zero over a 
finite region, the latter is source free. On the other hand, if div A is zero 
throughout all space, it may be concluded either that A is zero everywhere 
or that this vector function describes a field which is purely rotational in 
character.* The possibility that A might be constant throughout all 
space must be discarded on the physical ground that any field must 
vanish at infinity (exceptions to this rule are due to idealizations which 
are physically unrealizable). 


7. Gauss’s LAW 
According to the definition of the divergence as expressed by Eq. 56, 
*This alternative is considered in greater detail in Art. 10. 
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it follows that — 
fA-da= div Ado [62] 


closed surface enclosed volume 


Since the integrand in the right-hand integral, namely, div A dv, repre- 
sents the fluid productivity for the volume element dv, whence this 
integral yields the total productivity for the finite region, a result that is 
alternately given through integrating the fluid flow over the enclosing 
surface. 

This result, which is known as Gauss’s law, formally represents the 
transformation of a surface integral into a volume integral, or vice versa. 

It should be observed that the hydrodynamic analogy, used in the 
preceding article to lend concreteness to the definition of the divergence, 
tacitly implies that the “fluid ” be incompressible. An arbitrary vector 
function may ‘be likened to the flow of an incompressible or ideal fluid. 
- In this light, Gauss’s law becomes almost self-evident, since it states 
merely that all the incompressible fluid produced within a given region 
(this is the volume integral of div A) must issue from the enclosing 
surface. 

Gauss’s law is generally applicable to any vector function for which the 
divergence exists, that is, for which the partial derivatives given in Eq. 60 
can be formed. If this condition is not, the vector function A is in general 
regular and continuous throughout the region involved. When these 
conditions are not fulfilled, the desired transformation may in certain 
cases still be achieved by means of special manipulations to which the 
following discussion i is pertinent. 


8. IDEALIZED SOURCE DISTRIBUTIONS 


If the region contains points at which A is infinite or discontinuous, 
the divergence at such points cannot be evaluated by means of the formula 
given in Eq. 60. Such situations occur in practical problems under 
idealized assumptions. For example, it is sometimes convenient to 
assume that a source region is two-dimensional or one-dimensional, or 
. even that it has zero dimensions. These idealized source distributions 
are referred to respectively as a surface distribution, a filamental dis- 
tribution, or as a point source. 

For the continuous distribution of sources considered in the previous 
article, the source density or productivity per unit volume may con- 
veniently be denoted by some symbol such as p, and defined in con- 
formance with its analogy to electric charge density by the equation 


div A =p [63] 
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Equation 62 may then be written 


pA-da= fow [64] 


closed surface enclosed volume 


Here the volume integral on the right represents the total productivity 
of the region enclosed by the surface over which the integral on the left 
extends. Equation 64 evidently holds regardless of how the source 
density p is distributed throughout the enclosed region, and hence it is 
possible to assume (if convenient) that the total productivity is con- 
centrated at one or more points. These are then referred to as point 
sources. 

In certain physical problems, the actual distribution of source density 
approximates this idealization closely enough to justify such an assump- 
tion, and thus a simplification in the resulting mathematical relationships 
for the determination of the field is made available. 

A filamental source density distribution is similarly an idealization 
_ found convenient in certain types of physical problems. It is significant, 
however, that for the. point or filamental types of source distributions 
the concept of the divergence does not apply. External to the points or 
filaments, the divergence is, of course, zero, whereas for points coinciding 
with these idealized sources the divergence as defined above becomes 
infinite. 

For a point source, the total productivity (point charge) may be 
defined by the relation 

e = pdv [65] 


in which the density p is, of course, infinite because e is assumed finite. 
Similarly, for the filamental distribution, a productivity per unit length 
may be defined as 

g = pda [66] 


the cross-section of the filament being denoted by da. Here again, p must 
be considered infinite. 

Actually there can be no infinite density p, but the practical examples 
to which the idealizations expressed by Eqs. 65 and 66 apply are such 
that the geometrical relations are closely approximated when the actual 
finite source region or filament cross-section is replaced by the infini- 
tesimals dv and da respectively. 

In an analogous manner, a surface distribution of sources is defined 
as having a productivity per unit area given by 


o = pds [67] 


in which the differential thickness ds represents the actual small tite 
thickness of a continuous distribution in the form of a layer. 
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For this surface distribution, it is useful to extend the definition of 
the divergence by making use of the relations 63 and 64. A differential 
surface element da of the source layer is assumed to have a thickness ds 
which is vanishingly small in comparison with the surface dimensions of 
da. This is a perfectly admissible assumption which in fact adds to the 
preciseness of the definition 67. The integral on the left of Eq. 64, applied 
to the surface enclosing this element, reduces to the flow outward from 
the two opposite faces of the layer element, so that this equation yields 


o da = (An + Ang) da [68] 


in which A,1 and Ang are the outwardly directed normal components of 
A on the two sides of the layer. By analogy to Eq. 63 it is then possible to 
define a so-called surface divergence per unit area, given by 


div, A =o=Ant+Anz [69] 


9. THE SCALAR POTENTIAL FUNCTION ASSOCIATED WITH A GIVEN 
SOURCE DISTRIBUTION 


If A is a vector function describing a potential field, it is related to its 
sources by Eq. 63, and to a scalar potential function U by an equation 
similar to Eq. 54. Hence 


div A = — div grad U = p [70] 


and it follows that a potential function U may be associated with any 
given source distribution p. 
By means of Eqs. 46 and 61, this relationship may be written 


V-VWU = —p [71] 


and, with the help of Eqs. 44 and 60, the interpretation of this form is 
readily seen to be expressed by 
aU | eu, eu _ 


Ox? 7 ay? ae CO [72] 


Alternatively, the scalar product VV may be interpreted as the resultant 
operator 


3? a? 3? 
* ayt Tow 


V:-V=-5 
ox? ay 


[73] 

which yields Eq. 72 when applied to the potential function U. 
The differential operator of the second order defined by Eq. 73 is 
referred to as the Laplacian operator, and the form 
vu PU PU 


V-VW=—+ 


x2 ay? az? [74] 
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is spoken of as the “Laplacian of U.” For purposes of abbreviation, 
V - V is alternatively denoted by the symbols V? and A, so that Eqs. 71 
or 72 are frequently written either as 


V?U = —p [75] 
or.as 
AU = —p [76] 


the latter notation being the simpler but subject to some objection on the 
ground that the symbol A might, according to its more common inter- 
pretation, be confused with the notation for an increment. 

The significant point of the present discussion is the fact that a given 
source distribution defines an associated scalar potential function by 
means of Eq. 72, and this in turn determines the associated vector func- 
tion A by means of the relation 


A = — grad U [77] 


This view of the relation between the functions A and p may seem to be 
more roundabout than the apparently simple one expressed by Eq. 63, 
but in the solution of many practical problems it proves to be a more 
convenient method of determining A from p, because Eq. 72 involves 
scalar functions only and hence is frequently more easily integrated than 
Eq. 63. Once the function U is obtained, it is usually not difficult to 
determine A from Eq. 77 because doing so involves differentiation only. 

Moreover, there are certain problems in which the source distribution 
p is not known, but, instead, U and its normal derivatives are known 
over boundaries of given geometrical form to which the unknown source 
distribution is restricted. For the solution of these so-called boundary- 
value problems, the attack by means of Eq. 72 is the only possible one. 
At all points not located on the pp p = 0, and this equation reads 

PU aU 
Dp OS: i Se 
VU re awn aye + a 0 [78] 
A function which formally satisfies this differential equation, and in 
addition meets the stated boundary conditions imposed upon JU, con- 
stitutes the desired solution. 

Equation 72 is known as Poisson’s equation, and the corresponding 
homogeneous equation 78 as Laplace’s. The latter equation is of im- 
portance in connection with any problem in which the mapping of flow 
fields i EA essential, whether this is accomplished by ee or by graphical 
means.* 


*An application of this method to the determination of electric circuit parameters is given 
in Electric Circuits, Ch. I, Art. 6a, pp. 16 ff. 
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10. THE CURL OF A TURBULENT* VECTOR FIELD 


In the present article, attention is turned from the potential field to 
what might be called its complement — the turbulent or rotational field. 
Here the flow lines form closed circuits, so that the line integral of the 
vector function A taken around a closed path is in general not zero. 
That is, for a rotational vector function A 


fA-ds #0 [79] 


The seat or origin of a turbulent field lies in its vortexes or whirlpools. 
It is clear that the greater the intensity of the whirlpools in a given 
region, the greater is the value of the integral 79 evaluated for various 
contours within this region. It is, therefore, reasonable to suggest that 
the value.of this integral be used as a basis for defining the vortex density 
at any point. 

If the vector A is thought of as representing the force acting upon a 
particle, the integral 79 represents the work done by this force field as 
the particle is allowed to traverse a closed path. The path may, for the 
moment, be thought of as circular, with a radius 7. The value of the 
integral 79 divided by 27 then represents an average torque with respect 
to a concentric axis normal to the plane of this circular path. This average 
torque evidently varies with the angular orientation of the axis for a 
fixed location of the center of the circular path. There will evidently be 
one orientation for which the average torque is a maximum. As the radius 
is allowed to shrink until it becomes zero, the maximum average torque 
becomes zero also, but the ratio of this torque to the area of the circle is 
found to approach a finite limit when the force A has finite values. It is 
this limiting value of the ratio of the maximum average torque to the 
enclosed area which thus proves to be a useful measure of the turbu- 
lence of the field A at any given point. This value, except for the factor 
2m, is called the curl of A at that point. 

It is not essential for the definition of the curl that the closed path be 
circular. It is essential, however, that this path lie in a plane so that the 
orientation of the normal be clearly defined, for the curl is a vector 
having this same orientation. Thus the curl of A is given in magnitude 


*In some more recent considerations of hydrodynamic fields, the term iurbulent is used to 
designate an entirely random character that specifically does not permit a mathematical 
representation in terms of either the gradient or the curl. Consideration of fields having such 
random character is not included in the present discussion. The term turbulent in this volume 
is used merely as a means of distinguishing the rotational from the irrotational character 
of a field. 
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fads 


by the maximum value of 


limit plane contour [80] : 
JS da—0 f da 
enclosed surface 


as the plane of the contour assumes all possible orientations. In direction, 
the curl of A coincides with the normal to the plane of the contour, 


Z 


Fic. 9. An infinitesimal tetrahedron to illustrate the vector character of the curl. 


pointing in agreement with the advance of a right-hand screw which 
turns so as to correspond to the traversal of the contour in the evaluation 
of the closed line integral. The vector, curl A, is thus uniquely defined in 
direction as well as in magnitude. 

Just as the divergence of a potential field is at any point a measure of 
the density of its sources at that point, so the curl of a turbulent field 
establishes an analogous relationship between the intensity of that field 
and its vortex density at any point. — 

The vector character of the curl may be demonstrated by means of the 
geometrical configuration in Fig. 9, showing an infinitesimal triangular 
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contour ABC whose normal has an arbitrary orientation relative to a 
reference system of co-ordinates. The sides of this triangle together with 
portions of the co-ordinate axes form the six edges of a tetrahedron with 
vertexes O, A, B, C. The sides OBC, OCA, OAB, normal respectively to 
the X-, Y-, and Z-axes, have infinitesimal areas which are denoted by 
da,, da,, and da,. The area of the side ABC opposite the vertex O is 
denoted by da. Since da,, day, da,, are the projections of da upon the 
co-ordinate planes YZ, ZX, and XY respectively, one may write 


1 _ cos (nx) _ cos (m,y) . cos (m,3) [81] 
da da, da, da. 


in which cos (n,x), cos (m,y), cos (7,2) are the direction cosines of the 
normal n. 
It is likewise clear from the geometry of the configuration that 


A-ds= A-ds+ § A-ds+ § A-ds [82] 
ABC OBC OCA OAB 


inasmuch as the net integration represented by the right-hand side of this 
equation involves traversals in both directions along the edges of the 
tetrahedron emanating from the vertex O (see the circulatory arrows in 
Fig. 9). In terms of the notation 


g A-ds 
‘ABC 


RS | [83] 


and 


f Wide ¢ A-ds f Mecas 
OBC = OCA OAB. 


R, = - 
da, da, daz [84] 


the results expressed by Eqs. 81 and 82 show that 
R = R, cos (n,x) + R, cos (n,y) + R, cos (n,z) [85] 
Subject to the condition 
cos? (n,x) + cos? (n,y) + cos? (,z) = 1 [86] 


the expression on the right-hand side of Eq. 85 attains the maximum 
value 


Raz = VR +R? + RF [87] 
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for the particular orientation of the normal given by* 


y 


cos (”,x) = 


cos (,y) = 


cos (#,z) = = [88] 


i a 
Riaz Raz Max 
Since the relations expressed by the last two equations are the familiar 
ones existing between a vector and its rectangular components, the 
desired proof is completed. 

It is clear that the curl is an axial vector similar to that representing a 
mechanical torque, the right-hand screw rule serving in both instances to 
relate the rotational motion to the axial direction of the vector. The com- 
ponents of this vector with respect to a rectangular co-ordinate system 
may be determined by noting that the component of the curl in any 


direction s is given by 
fA - ds 


curl, A = limit | Manecontour normal tos [89] 
S da—0 f ie 
enclosed surface 


in which the direction of s is linked with the direction of traversal in the 
line integral by means of the right-hand screw rule. 

The x-component of the curl is thus found from consideration of a 
closed contour lying in the yz-plane. For convenience, this is taken to be 
the contour of a differential rectangle with its center at the origin, as 


*To maximize f(x,y,z) = ax + by + cz subject to the condition x? + y? + 2? = 1, form: 


r 
f=atbyta— . (a? + y? + 2? — 1). Equating the partial derivatives to zero gives: 


a oe Peet 
Ox r 
of 6 
— =b—r,y=0 = 
oy - came 
FS ameutg iets 
Oz r 


“The condition then yields \ = Va? + 6? + c* so that one has 
fmaz =VOP+FP4+C 
and the particular values of the variables yielding this maximum are 
a b c 


= 4= ,= 
Imax Smaz Smaz 


For other applications of the method of determining conditioned maxima see Arts. 4 and 5, 
Ch. IV. 


x 
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illustrated in Fig. 10. Since the co-ordinate axes form a right-hand system, 
the X-axis points upward from the plane of the paper, and the direction 
of traversal of the rectangular contour is as indicated by the arrows. It is 
clear that only the y- and z-components of A contribute to the x-com- 
ponent of the curl. 

The components of A in the direction of traversal along the sides 1-2, 
2-3, 3-4, and 4-1 may be denoted respectively by Ai2, d23, Aza, and 
Aa. If A, and A, denote the values of the y- and z-components of A at 
the origin, then, except for differential contributions of higher order 
(assuming A to be finite and continuous throughout the region covered 
by the rectangle), 


” 0A, dy 
Ap = A, + oy 2 
dun = —A, — 240 
oz 2 
A, d [90] 
2 dA, dy 
Au = —As ay 2 
0A, dz 
Aue as éz 2 
and hence 
PA-ds = Assadi + Ans dy + Ags de + Aas dy 
_ (aA: 24.) 
= (4 mare dy dz [91] 
According to Eq. 89, the «-component of curl A, therefore, becomes 
0A, OdAy 
curl, A = eae [92] 


In an analogous fashion, or through simply 
advancing the cyclic order of the letters x, y, z, 
the y- and z-components of curl A are found to be 


0A, 7 0A, 


curl, A = [93] 
Fic. 10. A _ differential a: a 
rectangle centered at the 0A 0A 
origin. cul, A => *— % [94] 


Comparing the. turbulent with the potential field again, it is interesting 
to observe the significance of the conditions under which 


$4 -ds=0 [95] 
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According to the discussion in Art. 5, these conditions are stated by 


A = — grad U [96] 
or 
0U aU ou 
A ————_ [i >_> — —— 
© ra Ay ay A, = [97] 
Now since 
eu  39U 
ax ay ay re etc. [98] 
it follows that the conditions 97 may alternatively be stated in the form 
: | dA, dAy _ 
ay as [99] 
dA, OA, _ 
az Ot : [100] 
aA, ade _ 
Ae ay 0 [101] 


The expressions on the left-hand sides of these equations are the com- 
ponents of curl A given in Eqs. 92, 93, and 94. Hence it is seen that 


curl A = 0 [102] 


becomes the necessary and sufficient condition for the vanishing of the 
line integral as expressed by Eq. 95. 

If over a given region the curl of A vanishes at all points, the line 
integral of A for any closed path within this region vanishes also,* and A 
may there be represented as the gradient of a scalar potential function 
as expressed by Eq. 96. Conversely, if the vector function A is the gradient 
of a scalar, its curl must be zero; that is, A must be nonturbulent. The 
gradient then, has no curl. In symbols, 


curl grad U = 0 [103] 


This statement is verified through substituting the relations 97 into the 
Eqs. 99, 100, and 101. 

~ It is useful to observe that the components of curl A given by Eqs. 92, 
93, and 94 may be combined into a single compact vector expression by 
means of the determinant form 


aj ik 

a 98 8 

curl A = Ox ay 2 [104] 
a AAs As 


*This statement is subject only to the restrictions pointed out in Art. 14, which need not 
be considered at the moment. 
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The Laplace development of this determinant for the elements of its 
first row is readily seen to yield 


écurl, A +7 curl, A + & curl, A [105] 


in which the respective components are the expressions 92, 93, and 94. 

By recalling the determinant form 25 for the vector product and the 
definition of the Hamiltonian operator V as given by Eq. 45, it is recog- 
nized that an alternative compact form for curl A reads 


curl A = VxA [106] 


In summary, the three important vector operations, gradient, diver- 
gence, and curl, are seen to be expressible in terms of the Hamiltonian 
operator, thus , 


grad U = VU 
divA =V-A [107] 
curl A = VxA 


11. StoKEs’s LAW 


According to the definition of the curl as expressed by Eq. 80 or in 
component form by Eq. 89, it follows that 


fads = f (curl A)-da [108] 
any closed contour any surface bounded ; 

: by that contour 
This result, which is known as Stokes’s law, formally represents the 
transformation of a closed line integral into a surface integral; or vice 
versa. Stokes’s law is the counterpart of Gauss’s law in the sense that it 
expresses for the purely rotational field what Gauss’s law expresses with 
reference to the potential field. 

It should be observed that the closed contour in Eq. 108 is not restricted 
to lie in a plane, and that the surface bounded by this contour may have 
any shape. For example, the contour may be visualized as a warped hoop 
and the surface as that of a rubber membrane bounded by the hoop but 
allowed to be stretched out sideways into any form whatever. If the 
hoop is thought of as being moderate in size, one may consider blowing 
the rubber membrane out sideways until it becomes inflated like a 
balloon, which may bulge backward over the ring, etc. 

The vector surface increment da, which is normal to the surface, points 
in that direction which is determined from the direction of traversal of 
the boundary by the right-hand screw rule. This correlation is shown in 
Fig. 11, In Fig. 12 the surface is for convenience drawn as a plane. The 
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scalar product (curl A) - da equals the line integral of A for the closed 
boundary of any one of the surface elements. The surface integral repre- 
sents the sum of all these elemental closed line integrals. This sum 


direction of traversal 
Fic. 11. Correlation of contour traversal with Fie. 12. The closed line 
element traversal when bounded surface does not integral equals the sum of 
lie in a plane. the elemental line integrals. 


evidently equals the line integral around the large boundary because the 
common boundaries of the surface elements are traversed in both direc- 
tions and their contribution to the surface integral is zero. Thus the 
validity of Stokes’s law, as stated by Eq. 108, is established. 


12. THE VORTEX DISTRIBUTION OF A TURBULENT FIELD 


For the turbulent field, it is convenient to define a vortex density J by 
means of the relation 


curl A = J [109] 


in which J is analogous to the current density occurring in the study of 
electricity and magnetism.* Stokes’s law, Eq. 108, then yields 


fA -ds 2 fra [110] 


closed contour enclosed surface 


Equations 109 and 110 are analogous to Eqs. 63 and 64. They relate the 
turbulent field to its vortex distribution J just as the potential field is 
related to its source distribution p. The vortex density J is the cause of 
the turbulent field just as the source density p is the cause of a potential 
field. , 

The essential difference is that p is a scalar function of the space co- 
ordinates whereas J is a vector function. In general, J is a finite and 

*In this analogy the vector function A represents the magnetic field intensity and should 


not be confused, with the magnetic vector potential which is customarily denoted by the 
letter A. 
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continuous function; that is, the vortexes are thought of as continuously 
distributed over certain regions just as the source density p is in general 
continuously distributed. In certain practical problems, however, it is 
convenient to idealize the vortex distribution. The situation is again 
analogous to the idealized source distributions discussed in Art. 8, except 
that there is no vortex analogue of the point source.* 

There is, however, a vortex analogue of the filamental source distri- 
bution. It is spoken of as a vortex thread. The amount I of the vortex 
thread is defined by the relation 


I=Jda [111] 


in which da is written in place of the actual finite cross-section of the 
filament. Since J is finite and da differential, this idealization requires 
the concept of an infinite vortex density J. The latter, as well as the 
vector I, is everywhere directed tangentially to the filament. The vortex 
thread may be thought of as a filament along which are concentrated 
whirlpools of infinite intensity, and about which a fluid is set into a 
swirling motion. The central portion of a tornado or “‘ twister,” and that 
of a “ waterspout,” are hydrodynamic examples permitting an idealized 
representation by means of a vortex thread. 

A surface distribution of vortexes may be thought of as the result of 
many threads placed side by side like the warp in a weaver’s loom. The 
vector moment J has the same value and direction for all the threads. 
A surface density g may here be defined by the relation 


ge JSds [112] 


in which ds represents the thickness of the surface or layer. There is no 
whirling action around the individual threads, that is, through the sur- 
face, because of the cancellation of this action along adjacent sides of the 
threads. The vortex surface merely causes a translatory motion of fluid 
in opposite directions along its two sides. Thus the tangential component 
of the vector field A is observed to change suddenly as the point of 
observation is shifted from one side of the vortex surface to the other at 
any given point. 

Figure 13 shows the vortex surface in cross-section, the cut being 
parallel to the tangential components of A and at right angles to the 
direction of g. The line integral in Eq. 110 is evaluated for the closed 
rectangular path with the differential sides ds and df. Here ds is assumed 
to be so small compared to df that the corresponding contributions to 
the line integral are negligible in comparison with those for the sides d?. 


*In this connection it may be observed that one does have a vortex analogue of the double 
point source (the doublet or dipole), namely, the vortex ring. 
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Equation 110 then yields 


in which Aj and Ay are the tangential components of A on the two 
sides of the surface. 

By analogy to Eq. 109, it is then possible to 
define a surface curl per unit length (in the d? di- 
rection), given by 


curl, A =g= Ay = A [114] 


This vector points upward from the surface of the 
paper (the assumed direction of the vortex density 
g) in Fig. 13. If m2 denotes a unit vector normal y 
to the vortex surface pointing from side 1 to side 2, ee ae ee 
the direction of the surface curl is contained in the surface in cross 
right-hand screw rule for the vector product in the section. 
definition 


curl, A = g = (Ag — Aig) * me [115] 


13. THE VECTOR POTENTIAL FUNCTION ASSOCIATED WITH A GIVEN 
VORTEX DISTRIBUTION 


The present article seeks to show that the vortex distribution of a 
turbulent field determines a vector potential function in a manner 
similar, except for certain anti-parallelisms, to that relating a scalar 
potential function to the source distribution of a potential field, as dis- 
cussed in Art. 9. 

The first step in this argument is to observe that if Stokes’s law, Eq. 
108, is applied to a closed surface, the line integral on the left-hand side of 
this equation is zero. This fact may be visualized through first considering 
the surface to be balloon shaped and the closed contour or boundary to 
be the small loop located at the throat through which the balloon is 
inflated. As this loop is contracted until the balloon is finally tied off, the 
closed contour shrinks to zera and the line integral vanishes. 

According to Gauss’s law, Eq. 62, it follows, therefore, that 


f (curl A) da = few (curl A) dv = 0 [116] 
closed surface enclosed volume 


Since the size of the enclosed volume is arbitrary, it may be considered 
to be infinitesimal; whence this result yields 


diy curl A = 0 [117] 
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The conclusion is that the vector field given by the function curl A is 
source-free; in other words, curl A is a purely turbulent vector field. This 
conclusion is true for any vector function A. The relation 117 is an 
identity. 

The curl then, may be said to have no divergence. This statement is the 
complement of the one made in Art. 10 to the effect that the gradient has 
no curl. Whereas the gradient function represents a pure potential field, 
the curl represents a purely turbulent field. 

The identity 117 may readily be checked independently through 
substituting the components of the curl given by Eqs. 92, 93, and 94 for 
the components A,, A,, A, in the expression for the divergence as given 
by Eq. 60. 

If a given vector function B is known to represent a purely turbulent 
field, that is, if 

divB =0 [118] 


the result just obtained permits the function B to be expressed as 
B=curlA [119] 


the thought being that the vector function A is thereby specified in terms 
of B, or vice versa. Here it must be observed, however, that whereas B 
is uniquely determined in terms of A by means of Eq. 119, the reverse 
is not true. 

This operation is readily appreciated from the fact that if the vector 
function A is assumed to be perfectly general, it may represent a field 
which exhibits both potential as well as turbulent characteristics. In 
other words, A may be represented by 


A=T+P [120] 


in which T is the source-free (or turbulent) component of A, and P is its 
curl-free (or potential) component, that is 


div T = 0 [121] 
and 
| curl P = 0° [122] 
Substituting the expression 120 for A into Eq. 119 yields 

B=curlT [123] 


because of the relation 122. 

It thus becomes clear that Eq. 119 alone does not determine the whole 
function A, but only its source-free component. If the vector function A 
is introduced by Eq. 119 merely for convenience, that is, as an auxiliary 
function in the course of an analysis, this difficulty may be overcome by 
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the further demand that 
divA=0 | [124] 


The vector function A is then uniquely characterized in terms of B by 
means of the two relations 119 and 124. 

This situation is similar to having given a purely potential field F, 
and assuming that a scalar potential function U may thereby be defined 
in terms of the relation 

= — grad U [125] 


In this case, U is determined only within an additive arbitrary constant, 
since the gradient of the latter is zero. Just so, Eq. 119 determines A only 
within an additive arbitrary vector function representing a potential 
field, since the curl of the latter is zero. Setting the constant component 
of U equal to some chosen value (for example, zero) is analogous to 
setting the divergence of A equal to zero in the present discussion.* 

It may now be supposed that the given source-free field represented by 
B has a vortex distribution described by the vector function J; that is 


curl B= J [126] 


From Eq. 119 it then follows that a vector function A is related to this 
vortex distribution by means of the equation 


curl curl A = J [127] 


subject to the condition expressed by Eq. 124. 

The function A which is thus related to a given vortex distribution J 
by means of Eqs. 124 and 127 is called the vector potential associated with | 
B, because it plays the same role relative to B and its vortexes that the 
scalar potential U plays relative to the associated nonturbulent field F 
and its sources. If the vector potential A is found from a given vortex 
distribution J by means of Eqs. 124 and 127, the turbulent field B is 
determined from Eq. 119. This method of determining B from J, al- 
though apparently roundabout, is frequently found to be more con- 
venient than that of attempting a solution directly in terms of Eq. 126. 

It is necessary to interpret further the repeated curl operation in Eq. 
127, as is readily done by means of Eqs. 92, 93, and 94 for the components 
of the curl. Thus the x-component of curl curl A is seen to be given by 


8 (dAy _ 242) _@ (4: 2 a) _ 
ay\ ax ay az\ oz ax J 


*Any other disposition of the value of div A, appropriate to the circumstances pertaining 
to a specific problem, may likewise be made. 
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2 
Adding and subtracting from this expression the term a and ap- 


propriately grouping the result, show that 
aA, aA, 
Ox 02 


7A, 007A, . 7A, 
- (3 + aye + 3) [129] 


By means of Eqs. 60 and 74, this may be written 


curl, (curl A) =2( + ad + 


curl, (curl.4) = = (div A) — V?A, [130] 


in which V? is the Laplacian operator defined by Eq. 73. 

The y- and z-components of curl curl A are found in a like manner, or 
more simply through advancing the cyclic order of the variables x, y, z in 
Eq. 129. The result is 


‘, 9 .@ a ‘ 
curl curl A = (3 ne +j ay + 22) (div A) 
—V°(iA, +jAy+kA,) [131] 


which is 

curl curl A = grad div A — div grad A [132] 
or 

curl curl A = V(V- A) — V?4_ [133] 


It is interesting that although the gradient of the vector function A is 
not defined,* the Laplacian of A, or div grad A, is interpreted according 
to Eq. 131 as 


div grad A = V7A =i V7A, +7 V?A, +k VA, [134] 


Equation 127, subject to the condition 124, is now seen to yield the 
equation . 
VA = —J [135] 


relating the vector potential associated with B to its vortex distribution. 
The nicety of this result is that it is identical in form to Eq. 75 relating 
the scalar potential U to the source distribution p of a nonturbulent field. 
The only significant difference is that Eq. 135 involves vector functions 
whereas Eq. 75 involves scalar functions. This fact means that the vector 
equation 135 is equivalent to three scalar equations of like form, one for 
each of the three corresponding components of A and J. 


*An interpretation of the gradient applicable also to vector functions is given in Art. 15 
of this chapter. 
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14, THE POSSIBILITY OF A MULTIVALUED POTENTIAL FUNCTION 


It has been shown that if, throughout a given region, a vector field A 
is conservative so that curl A is identically zero at all points within this 
region, A may there be expressed as the gradient of a scalar potential 
function U. Furthermore, it has been pointed out that the existence of such 
a potential field is in general due to a distribution of sources which, ac- 
cording to the hydrodynamic analogy, are fluid-producing regions. One 
may be tempted to conclude, therefore, that a potential field can be due 
only to sources, and never to vortexes, because these latter are regions of 
turbulence and hence can produce only turbulent fields. 

A moment’s reflection, however, yields the thought that if a distribu- 
tion of vortex density J is confined to a finite region of space, only within 
that region is the curl of the associated vector function A different from 
zero. Everywhere else the curl of A is zero, and the field is, therefore, 
potential (that is, conservative) in character. 

A very important distinction, however, should be made between the 
conservative field which is due only to pure sources, and that which is due 
to vortex distributions restricted to certain excluded regions of space. To 
state the matter in another way, it is important to distinguish between a 
field which is conservative throughout al/ space and one which is conserva- 
tive only within certain portions of space, or to the exclusion of certain 
portions. The field which is conservative everywhere can have its origin 
in true sources only, whereas the one which is conservative within re- 
served regions (finite or infinite in extent) can be due to vortexes as well 
as to sources, although the former are confined to lie outside the reserved 
regions. : 

This distinction is concerned primarily with the nature of the associ- 
ated scalar potential function U. Thus, if the field is conservative 
throughout all space, or throughout a simply connected region 6 (see 
Fig. 14), the associated potential function U there is single valued; but 
if the region over which the field is conservative is a multiply connected 
one (Fig. 15), the associated potential becomes a multivalued function. 

A region is said to be simply connected if for every closed contour lying 
entirely within the region a surface bounded only by that contour can be 
constructed such that every point of the surface also lies within the region. 
The significance of this statement is best understood from an example of a 
region which is not simply connected. Thus the space outside a doughnut- 
shaped region is not simply connected because a closed contour which 
links the doughnut (passes through the hole) cannot form the sole 
boundary of a surface every point of which is required to lie outside the 
doughnut. If the doughnut is cut so that it no longer forms a closed ring, 
the space becomes simply connected. With the doughnut intact, the 
surrounding space is said to be doubly connected. 
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Another illustration of a doubly connected space is that surrounding a 
closed cylindrical surface of infinite extent like a straight tube which is 
infinitely long in both directions. Here a contour which encloses the 
cylinder cannot form the sole boundary of a surface which lies wholly 


any plane cross-section 
; of a simply connected 
region must be a closed 
curve as in “a’? it must 
not cross itself as in “b’’ 
a b 


Fic. 14. A simply connected region. 


outside the cylinder. If the space enclosed by the cylindrical surface is 
filled with vortexes whose cross-sectional distribution is given. by the 
vector density J (directed longitudinally), then, since by assumption 
J = 0 outside the cylinder, the associated field function A satisfies the 
equations , 

curlA = J [136] 


within the cylinder, and 
curl A = 0 [137] 


outside the cylinder. 
In the region outside the cylinder it is, therefore, possible to assume 


A = — grad U [138] 
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connecting canal 
or tube 


connecting tube d can be used instead of a but not both at the same time. 
the connecting tubes, of infinitesimal cross-section, are the 
so-called barriers of the region 


Fic. 15. A multiply connected region. 


With reference to Fig. 16, which shows a cross-section through the 
cylinder, the line integral of A formed for the closed path P, is evidently 
zero because it does not enclose the vortex region. This fact follows 
readily from Stokes’s law. However, for the closed path P2 the same law 
yields 


fp A-ds= fs-da=1 [139] 
P, 
But by Eq. 138 


¢ Adee ¢ (grad U) - ds [140] 
P2 Pz 
and hence it follows that. _ 
g (grad U)-ds = —I [141] 
Pa 


If the line integral is thought of as extending from some point @ on the 
path P, around the closed contour back to the same point a, this result 
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indicates that 


¢ (grad U)-ds = Us — Ua =1 [142] 


in which U, is the value of the potential U at the point a. The only 
possible conclusion to be drawn from Eq. 142 is that the potential func- 
tion U is not uniquely defined for points in the space surrounding the 


cylindrical surface 


C )s 
R, 


Fic. 16. One path of integration which encloses a vortex; another which does not. 


vortex 
density J 


a 


region enclosed by the cylindrical surface. Indeed, if the line integral is 
extended twice around the contour Pe, the result is 


U, — U, = 21 [143] 


and for ” circuitations 
U,-—U,=nl1 [144] 


The potential U at any point such as a clearly is defined only within an 
additive integer multiple of J. This is the nature of the multivalued 
potential function U in this example. 

The structure of the space surrounding the cylindrical region may be 
thought of as composed of leaves lying in the cross-sectional plane and 
winding about the cylinder as a winding stairway encircles a column. 
The density of these leaves longitudinally is infinite so that the process of 
encircling the cylinder any finite number of times around the contour 
P, without piercing any leaves involves no net longitudinal motion. Any 
point such as a is then referred to as 4, @2, dg, «+ - etc., according to an 
assumed numbering of these hypothetical leaves, and Eq. 144 may then 
be written more precisely in the form 


Vain — Ua, = nl [145] 

Although the multivalued potential function U may, through such a 
mathematical artifice, be transformed into a single-valued one, the rather 
complicated interpretation of the space outside the cylindrical region is 
given here principally for the purpose of lending visual clarity to the 
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concept of multivaluedness. A practical problem involving these concepts 
arises in the subject of magnetostatics where: the cylindrical region is 
identified with a current-carrying conductor, the density of the electric 
current being J, whereas the total conductor current has the value J. 


15. Tue DIFFERENTIATION OF SCALAR OR VECTOR FUNCTIONS WITH 
RESPECT TO THE TIME 


A given vector function A is considered to be a function of the space 
co-ordinates x, y, 2 and also of the time é, that is, 


A = A(xy,s,2) [146] 


The behavior of this function is studied at some point P in space, having 
the co-ordinates x, y, and z. More specifically, this point is denoted by 
P(«,y,z). This notation may refer to any specific point if proper values are 
assigned to x, y, and z. P is also referred to as “ the point of observation.” 

In considering the time derivative of A at the point P, or the rate at 
which the vector A changes with respect to the time at the point of 
observation, two possibilities must be distinguished, according to whether 
the point of observation is stationary or is in motion relative to the refer- 
ence co-ordinate system. In the former case, the variables x, y, and z are 
cqnstant, and in the latter they are functions of the time — that is, the 
point P moves with a velocity v given by 


v= iv, +juy+ ho, - [147] 
with 
dx dy dz 
vy, = di vy = di 0, = a [148] 


If the point P is stationary (x, y, z are assumed constant), the time 
derivative of A is understood to be the partial derivative 0A /at; if P is 
not stationary, the time derivative of A is given by the fofal derivative 
dA /dt. 

Identical remarks apply to the time derivative of a scalar function U ; 
which may, for example, be the potential energy of a particle located at 
the point P. If the particle is stationary, the potential energy may change 
with the time because of a time variation of the field in which it is em- 
bedded. This possibility is denoted by the partial derivative aU/at. If 
the particle is in motion, the met rate of change of its potential energy 
with respect to the time is denoted by dU/dt, and is in general due to the 
combined effects of the time variation of the field in which the particle 
is embedded and of the particle’s own motion through this field. 

Neglecting differential effects of higher order than the first, the total 


A 
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time derivative of the scalar function U is expressed by 
dU aU , Udx | Udy , Was 


dt oat | dxdt | adaydt | az dt [149] 


The first term on the right-hand side of this equation represents the time 
variation of U on the assumption that the particle at the point P is 
stationary. The sum of the remaining three terms represents the time 
variation of U due solely to the motion of the particle through the field 
which (as far as these terms are concerned) is constant at any fixed 
point but varies from point to point. The net time variation dU/dt 
appears as a linear superposition of these two separate effects because 
the contribution of higher order derivatives may (according to the 
principles of the differential calculus) be neglected. 

According to the definitions of the gradient and the scalar product, 
Eq. 149 may be written in the form 


— = —+0-gradU [150] 


in which the velocity v of the point of observation is defined by Eqs. 147 
and 148. If U is. not an explicit function of the time — that is, if Visa 
function of the space co-ordinates alone — 9U/dt = 0, and dU/dt is 
given by the second term in Eq. 150 alone. This term equals the com- 
ponent of grad U in the direction of v, multiplied by the magnitude of v. 
In other words this term equals the space rate of change of U in the direc- 
tion of v, multiplied by the magnitude of v. It is convenient to regard this 
as a resultant operation upon the function U by defining the operator 


ce) ce) 


0 
. (v- grad) = (v- V) = a ra + vy 5 + v, a [151] 
and by writing Eq. 150 
dU aU 


_ The vector v in the operator defined by Eq. 151 may be replaced by any 

vector function B, and (B- V) is then referred to as the ‘‘ B-gradient.”’ 
The symbolic operation (B-V)U is read: “The B-gradient of U.” 
Unlike the gradient of U, it is a scalar quantity. 

The convenience resulting from defining this operator is seen when the 
operator is used in connection with a vector function A. Whereas the 
gradient of a vector function cannot be interpreted according to the 
definition of this vector operation, the B-gradient is readily seen to be 
applicable to vector functions, and yields a vector, thus 


(B-V)A =i(B-V)A,+j(B-V)Ay+R(B-V)A, [153] 
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in which the x-component reads 


0A, 0A, oA 
5; eb 
ax toe ay 7 dz 


Similar expressions apply to the y- and z-components. 
The total variation of a vector function A with the time is, therefore, 
given by the expression 


(B-V)A, = B- VA, = B, 


dA 0A 

rE arr + (W-V)A [155] 
This is, in general, a function of the time. At any given instant, the vector 
(v- V)A represents the space rate of change of A in the direction of v, 
multiplied by the magnitude of v. To state the case in another way, 
(v- V)A is the vector change (increment) of A per unit displacement of 
the point of observation in the direction of v, multiplied by the magnitude 
of 2. 

A purely algebraic interpretation of the operation (v- V)A is obtained 

by letting 


(v0: V)A = G = iG, + 7G, + kG, [156] 
Then 


aw vert a ty +7 % = Ge 

aAy dA, dAy 

az + Ff Dy + aim Gy [157] 
0A, 0A, 0A, 


v v — 

Ox oy Oz 

The operation is thus recognized as amounting to a linear transformation 
of the vector v. If for abbreviation one writes 


3A 0A, OA 0A, 
Oe = Ory “ay” tty Oey = ae tty Caz = [158] 
then the matrix of the transformation is 
arr Agy Azz 
Gi] Gye. Gy Cys [159] 
Azx Azy zz 


The operation (v - V)A is thus regarded as a tensor of valence 2, which 
associates a vector G with a given vector v at any point in space. The 
elements of the matrix @, defined by Eq. 158, are the components of this 
tensor. Their values depend upon the vector function A. G is said to be a 
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linear vector function of v, the characteristics of this relationship being 
determined by the nature of A. 

Tn passing it may be of interest to note that the operations of forming 
the curl or a vector product may also be regarded as linear vector trans- 
formations. Thus the vector, curl A, is expressible as a linear transforma- 
tion of A with the skew-symmetrical operator matrix 


a 
dz oy 
ae ee [160] 
Ox 
F) a 
— Pa ae 0 


and the vector product A ~ B is expressible as a linear transformation of B 
with the skew-symmetrical matrix 


0 -A, ° Ay 

A, 0 -A, [161] 
= A y A xz 0 

The time derivative of the product of two vector functions is evaluated 


according to the familiar rule for the differentiation of a product. For the 
scalar product, the result reads 


d dA dB 

qe DG atsg [162] 
and similarly for the vector product 

d dA dB 


in which it is important to preserve the order of the two functions. 


16. ADDITIONAL USEFUL VECTOR RELATIONS 


Although the discussions of the preceding articles suffice to determine 
the results of more complicated operations involving combinations of 
scalar and vector functions, it is useful to have available several additional 
formulas covering those relationships which are encountered more fre- 
quently in practical problems. The following items are concerned with 
derivations of this sort. 

(i) The divergence of the product of a scalar and a vector. 
The scalar function is denoted by U and the vector function by A. 
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Then 
div (UA) =* (vA. +2 (04, +2 (04) [1068] 
™ ~ Oe ay "ay 7 


By the rule for the differentiation of a product, this is seen to yield 
div (UA) = U div A+ A- grad U : [165] 
(ii) The curl of the product of a scalar and a vector. 
Here the rule for the differentiation of a product may be expressed in 
the following way: 
curl (UA) = Vx (UA) = Vx (UA)y + Vx (UA)a [166] 
in which the subscripts mean that the functions U or A are to be treated 


as constants. According to the definition of the curl by Eq. 104 it is 
- readily seen that 


Vx (UA)y = U curl A_ [167] 


and 
Vx (UA)4 = —Axgrad U [168] 
Hence 
curl (UA) = U curl A — Axgrad U [169] 


(iti) The divergence of a vector product. 
This may be written 


div [AB] = v-[4«B] = V-[A«Bla+V-[AxB]p [170] 


the subscripts having the same significance as before. From Eqs. 26 and 
38, 


V-[Ax Bla = —A-[V«B] = —A-cul B [171] 
V-[A~Blz = B-[VxA] = B-curl A [172] 
so that 
div [A xB] = B-cwl A —A-cul B [173] 
(iv) The curl of a vector product. 
Here . 


curl [A x B] = V«[A«B] = Vx[AxB]la+V~[A~B]g [174] 
Applying Eq. 32 forthe triple vector product gives 
Vx«[Ax Bl, =A(V-B) -(A-V)B [175] 
and . 
Vx[AxBl]z = (B- V)A — B(V- A) [176] 
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The operations A-gradient and B-gradient are defined by Eq. 151 in the 
previous article. Thus 
curl [A x B] = A div B —- BdivA + (B-V)A — (A-V)B [177] 
(v) The vector product of a vector and the curl of another vector. 
Here Eq. 32 i§ applied to the triple vector product 


Axcurl B= Ax[V~x B] [178] 
the vector function A being treated as a constant. Thus one obtains 
Ax[V~xB] = V(A-B)4 — (A: V)B [179] 
‘or 
Axcurl B = grad (A-B)4 — (A-V)B [180]. 
(vi) The gradient of a scalar product. 
This reads 
grad (A - B) = grad (A- B)4 + grad (A- B)z [181] 
By Eq. 180, 
grad (A- B)4 = Axcurl B+ (A-V)B [182] 
grad (A - B)g = Bxcurl A + (B-V)A [183] 
Hence 


grad (A- B) = Axcurl B+ BxcurlA + (A-V)B+(B-V)A_ [184] 


(vii) The volume integral of the scalar product of a potential and a solenoidal 
vector function. 
A purely potential field is described by the vector function A, and a 
purely turbulent field by the vector B; that is, 


curl A = 0 [185] 


and 
div B =0 [186] 
throughout all space occupied by these fields. In the volume integral 
fia Ba [187] 


which is extended over all space, or over that portion to which the field 
B may be confined, the element of volume dy is represented by an ele- 
mentary length ds of any one of the closed tubes characterizing the flow 
map for the solenoidal field B. If the cross-sectional area of a flow tube 
is denoted by da, the integral 187 may be written . 


fe .B) dv = ffs da (A - ds) [188] 
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Here the vector element of length ds of a flow tube has the same direction 
as B. The integration with respect to ds extends around the closed circuit 
of a flow tube, and the integration with respect to da extends over all the 
flow tubes. 

According to the definition of a flow tube, |B| da is constant through- 
out the closed circuit mapped by this tube, so that this factor may be 
placed. before the integral with respect to ds. Hence 


fa-Ba= fia da fA ds [189] 
Since A defines a potential field, it follows that 
fA-ds=0 [190] 


so that the final result reads 


J (A+B) dv=0 [191] 


P (x,y,z) 
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17. THE VECTOR r 


In connection with many field. problems it is convenient to introduce 
a vector r which represents the vector distance between two points P 
and Q in space as shown in Fig. 17. The point Q may be the location of a 
cause (such as a source or vortex) whereas P is the point at which the 
resulting field is observed. The vector r is assumed to point from Q to P. 
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Hence if the co-ordinates of Q are denoted by , 7, ¢, and those of P by 
x, y, 2, the components of r are given by ; 


t, = (4% — é), Vy = (y ry 1), re = (2 —- g) [192] 


and the vector r is 


r=i(¢—£)+jy—1)+ke-f) [193] 
The magnitude of r is expressed by the scalar function 
r=V@-—E+0-2% + @— FP [194] 


In discussing various operations in terms of either the vector r or the 
scalar function 7, a distinction must frequently be made according to 
which of the points P or Q is considered to be variable. If Q is fixed and P 
variable, the operations of differentiation or integration apply to the 
co-ordinates x, y, z. This state of affairs may be indicated by attaching a 
subscript P to the operator in question. Similarly, a subscript Q indicates 
that P is considered fixed, and the variables é, y, ¢, are affected by the 
operator. For example, 


. 0 . 9 re] 
grade 9 ge ay! oe [195] 
whereas 


.o .o é , 
grado = ta tis th [196] 


: 0g oF 
Similar distinctions apply to the operator div and curl. 

Because of the symmetrical manner in which the variables x, y, z on the 
one hand and &, 7, ¢ on the other, enter into the expressions 192 for the 
components of r, the difference between a given operation as evaluated 
for the subscripts P and Q is-readily recognized. Thus, for the operations, 
grad, div, curl, for example, the result for a subscript P is simply the 
negative of that for a subscript Q. 

In the following more specific discussion, therefore, the variables are 
always assumed to be x, y, z. Without loss in the generality of this dis- 
cussion, the fixed point Q may then be assumed to be coincident with the 
origin of co-ordinates. This simplifies the expressions 192, 193, and 194 to 


te =X, y=, te = 8 [197] 
r= ix+jy + kez [198] 
and 


r= VP +P +H [199] 
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- In terms of these expressions it is readily seen that 
. or i or 1, be r 
gadr =i +i + eS = — Ga + jy + he) == [200] 


In other words, the gradient of the scalar function r is equal to a unit 
vector in the direction r. 

For the evaluation of the gradient of a function of 7, it is useful to 
observe that 


or or or 


rs) ‘ 
grad = (i x 45% + 8%) [201] 


Hence if f(7) denotes a function of r, and the partial derivative af/ar is 
written I(r), 


grad f(r) = f’(r) gradr = f’(r) ; [202] 
For example, if | 
| | f=" [203] 
then 
. grad r” = nr"*r ; [204] 
An application of this result to a commonly occurring form reads. 
_ grad () =— 5 [205] 
According to Eq. 60, the divergence of the vector function r is 
dive = 24 S84 [206] 
Oz 
and with Eq. 197 this gives 
divr = 3 [207} 
From Eq. 165 it follows that 
div (er) = 7" divr + r- grad 7” [208] 
By eeu from Eqs. 204 and 207, this yields 
div (r"r) = (n + 3)r” [209] 
This result, together with Eq. 204, shows that 
div grad r" = V7r" = n(n + 1)r"? [210] 


In particular, for n = —1, this gives the important result 
v? (*) =0 [211] 
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Evaluating the curl of the vector function r by substituting the com- 
ponents 197 into Eq. 104 shows that 


curlr = 0 [212] 
and the use of Eq. 166 then shows further that 
curl (r"r) = 0 [213] 
In particular, 
curl (4) =O [214] 


The vector function r, or (rr), may be looked upon as defining a pure 
potential field. 

A number of vector relations formed from the function 7 in combination 
with an arbitrary vector function A are also useful. Using the result 
. expressed by Eq. 184 together with Eq. 212 yields 


grad ©-A) =rxculA+(r-V)A+(A-V)r [215] 
But 
(v= (4,2 at bed + Aes os =A [216] 
so that 
grad (r-A) =rxculA+(r-V)A+A [217] 
. Equations 173 and 212 show that 
div [rx A] = ~r-curlé [218] 
Whereas Eqs. 177, 207, and 216 yield 
curl [rx A] =rdivA —2A — (¢- V)A [219] 


18, CURVILINEAR CO-ORDINATES 


Because of the geometry inherent in some physical problems, it is 
advantageous to designate various points in space in terms of co-ordi- 
nates other than the rectangular Cartesian ones which have been used in 
this discussion so far. In a problem which exhibits cylindrical symmetry, 
for example, it is usually effective to use cylindrical co-ordinates; if the 
physical problem has spherical symmetry, spherical polar co-ordinates 
are usually preferable, and so on. 

The variables occurring in connection with any of these curvilinear 
co-ordinate systems may be denoted by w,.v, w, as contrasted with the 
variables x, y, z, used for the rectangular Cartesian system. In the 
Cartesian system the curves for constant values of y and z, z and x, and 
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x and y form mutually orthogonal families (which in this instance are 
straight lines). For this reason, the rectangular Cartesian co-ordinates 
are said to be orthogonal. 

In a curvilinear co-ordinate system, the curves for constant values of 
v and w, w and u, and u and v may also form mutually orthogonal fami- 
lies, for example, in the case of cylindrical or spherical polar co-ordinates. 
The curvilinear co-ordinates are then said to be orthogonal. The dis- 
cussion of the present article is restricted to systems of this sort since the 
need for a more general system very rarely arises in practice. Moreover, 
the mathematical apparatus necessary for dealing with perfectly general 
curvilinear co-ordinates — the absolute differential calculus — requires a 
detailed study of considerable depth, which seems warranted only when 
sufficient use for it in connection with other practical problems presents 
itself. 

For the orthogonal curvilinear co-ordinates, a set of mutually orthogo- 
nal unit vectors 7, 7), 7 are defined which are analogous to the vectors 
i,j, k in the Cartesian system. The unit vector z,, at any point in space is 
tangent to the curve v = constant, w = constant, at that point. Simi- 
larly, 7, is tangent to the curve w = constant, u = constant, and i, is 
tangent to the curve « = constant, v = constant. The directions are, 
moreover, so chosen that 7, 7), 7 form a right-hand system. 

It should be observed at this point that the scale of length which is 
implied in the designation of 7,, 7), 7 as unit vectors is that pertaining 
to the rectangular Cartesian co-ordinate system. The scales for the 
curvilinear co-ordinate axes u, v, w at any point are in general different 
from this scale of length, and are moreover a function of the position of 
the point in space. In other words, the magnitudes of the increments 
du, dv, dw as measured in the Cartesian co-ordinate system are 


ds, = €, du 
ds, = €, dv [220] 
dSy = Cy dw 

in which e,, @y, €» are factors accounting for the differences in the scales 


for the «, v, and w co-ordinate axes at the point in question and the scale 


of length of the Cartesian system. 
A given vector increment of length ds is expressed in the Cartesian 


co-ordinate system. by 
ds =idx+jdy+kdz [221] 


and in. the orthogonal curvilinear system by 
ds = i, dsy + ty dsy + ty dSy [222] 
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- The length of the vector increment ds in terms of the Cartesian co- 
ordinates is determined by 


(ds)? = (dx)? + (dy)? + (dz)? [223] 
whereas in terms of the curvilinear co-ordinates 
(ds)? = (dsy)? + (dsy)? + (dsw)? [224] 
or ; 
(ds)? = e,7 (du)? + e,?(dv)? + ew? (dw)? [225] 


it being understood that this length is measured by the scale of the 
Cartesian system. 

The scale factors ¢,,, €v, €w, which in general are functions of the position 
of the point in question, that is, functions-of x, y, z or u, 2, w, may for 
given curvilinear co-ordinate system be found in several ways. In simple 
cases the expressions for ds,, ds», and ds, may be written down by 
inspection; whence the scale factors according to Eqs. 220 become 
evident. A more formal method of determining them follows. 

For any particular curvilinear co-ordinate system, the co-ordinates 
x, y, 2 are expressible as functions of w, v, w, thus 


x = fi (u,0,w) [226] ° 
i te (u,v,w) [227] 
a= fs (u,0,w) [228] 


According to the rules of the differential calculus, 


Bett + ed + Sa de = de 

2 ay += th 5 OO as af poe = dy [229] 
fs a uh = 

Pali rs dy + 27 dw = de 


or 


os 1 a 1a +2 1 


au Tae ee Eine Oe 


ti 2 ds. + 2 ts 2g a ? sy = dy [230] 


Su 


wy a i enak 
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In terms of the transformation matrices 


aft 


Ou. 


g =| oh 
Ou 


afs 
Ou 


and 


the relations 229 and 230 may be written in the matrix form 


and 


aft 
ov 
af, 
ov 
fs 
ov 


ah, 
OSy 
fa 
OSy 


fs 


dsSy, 
F,| dsy 
Las 


From Eqs. 220, it is clear that 


ds, | - [| ey 
ds, | =| 0 
dsy 0 


’ 


ro) 


and hence that 


®& 
e 


o 


eu 


0 
0 


Since both the Cartesian system 
orthogonal, it follows (see Art. 4, Ch. III) that the matrix F, in the 
transformation 234 is orthogonal, and hence that 


(Fa)e = Fe 


OSy 


af 


ow 
afe 


ow 


ow 


OS w 


dx 
dz 


ey 


0 


237 


[231] 


[232] 


[233] 


[234] 


[235] 


[236] 


and the curvilinear system are 


[237] 
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the subscript ¢ denoting the transposed matrix. But from Eq. 236 


é, O 0 
Fy =/0 Cy 0 (Fs)e [238] 
0 O ey 
and with Eqs. 236 and 237 this gives 
e O O 
FF = 0 6," 0 [239] 
0 O e,? 
Hence, by means of Eq. 231 it follows that 
eae aes) 
2_(W of2 3 
- (4) + au) Tou [240] 


oo ~ (BY + +(2) + +(#) [241] 
ORO Cm 


from which the scale factors are determined. 

As an illustration, the cylindrical co-ordinates 
may be considered, for which the z-axis is chosen 
coincident with the axis of cylindrical symmetry 
~ (Fig. 18). The relations 226, 227, and 228 then 
* read 


x= Prcos¢ . [243] 
Fic. 18. Cylindrical co- y=rsin¢ [244] 
ordinates with the z-axis =) 
as axis of cylindrical a= 2 [245] 
symmetry. 


The variables 7, ¢, 2 are chosen to correspond 
respectively to u, v, w. Then Eqs. 240, 241, and 242 yield 


e,2 = cos? ¢+sin?g+0=1 [246] 
e=rsin?d +r? cos?¢+0=77 [247] 
e2 =0+0+1=1 [248] 
Hence 
é=1, eg=r, @ =t [249] 


are the scale factors for this case. The vector increment of length is 
ds = i, dr + igr dp + i, dz [250] 
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that is, 
ds,=dr, dsy=rd¢, ds, = dz [251] 
and 
(ds)? = (dr)? + 7?(dg)? + (dz)? [252] 


In this simple example, the last three results can, of course, be written 
down at once from inspection of Fig. 18. 


Fic. 19. Spherical polar coordinates. 


The geometrical relations between spherical polar and rectangular 
co-ordinates are illustrated in Fig. 19. Accordingly, Eqs. 226, 227, and 
228 become 


x =rsin 6 cos ¢ [253] 


y=rsin@ésing [254] 
z= rcosé [255] 


The variables r, 8, @ are chosen to correspond respectively to u, v, w. 
Then Eqs. 240, 241, and 242 yield 


e,” = sin? @ cos? ¢ + sin? 6 sin? ¢ + cos?@ = 1 [256] 

ed” = 1° cos” 6 cos” ¢ + 1 cos*@ sin? ¢ + 7? sin?@ = 7? [257] 

eg” = r* sin? @ sin? ¢ + 7? sin? @cos?¢@+0=r?sin?6 [258] 
Hence 


ée=1, @=r, eg =rsind [259] 
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are the scale factors. The vector increment of length is* 


ds = i, dr + ier dd + igr sin @ de [260] 
that is, a 
ds, = dr, ds=rd9 dsg=rsin@dd [261] 
and 3 
(ds)? = (dr)? + r°(d@)? + 17 sin? @ (do)? [262] 


The most commonly used vector operations are the gradient, diver- 
gence, curl, and div grad, or V?. Formulation of these in terms of any 
orthogonal curvilinear co-ordinates u, v, w and specifically for the cylin- 
drical and spherical polar co-ordinates are discussed in turn. 

The expression for the gradient of a scalar function U reads 


aU aU aU 


grad U =i, get? get ore [263] 
or 
10aU 10U . 100 
grad U = i, “ e, OU prea 0. on en, OW [264] 


With the help of Eqs. 249, this becomes, for cylindrical co-ordinates, 
au |. 1aU _ , aU 


grad U = in aT rs + i, ae [265] 
and for spherical polar gota! substitution from Eqs. 259 gives 
1aU 1 aU 
grad U = i, — tae tee [266] 


For the formulation of the general expression for the divergence of a 
vector function A, the definition given by Eq. 56 is applied to acurvi- . 
linear parallelepiped (Fig. 20) enclosed by the surfaces u = constant, and 
u + du = constant; 2 = constant, and v+ dv = constant; w = con- 
stant, and w + dw = constant. Except for differentials of higher order, 
the elementary areas of opposite faces of this parallelepiped are equal. 
Specifically, the elementary areas of the faces normal to tus ty, ty are 
given respectively by 


day, = dsy dSy = Cr€w dv dw [267] 
day = Sy dSy = Cyl, dw du [268] 
day = ds, dsy = Cyey du dv [269] 


*In this example, one may also write down the expression for ds directly and thus obtain 
the scale factors without applying the formal method. 
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The volume of the parallelepiped is : 
dV = dsy dsy dS = Cy€rCy du dvdw — [270] 

The surface integral of A is given by 


fA +da = [AydayJusau — [AudauJu + [Avdas]oyav — [Avdar]y 
+ [Aw daw) vido —[Auday]y [271] 


Fie: 20. A curvilinear parallelepiped formed by surfaces of constant.#, v, and w. 
in which A,,, Ay, Ay are the components of A in the directions of 7,, Zp, ty 
respectively, and the subscripts indicate that the quantity enclosed by 
the brackets is to be evaluated at the points u + du, v, w; u, v, W; us, 
v + dv, w; u,v, w; etc. Now 

| ‘ 
so that 


Wg 2 a 
[Au day)updu pa [Au daulu = au [Au da, udu 
c= = [Aveewlududvdw [273] 
and so forth. Hence 


0, F) FY 
Ay Pee hae er Ged a ; 
fA da | pq Auevew) + pp (A veweu) +37 (A veste)| du dudw [274] 


v 
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so that the general expression for the divergence becomes 

SA cae A-da 
dsy, ds, dS, dS. dSuy 


u i= (Aulvew) + — Teen reat [275] 
U 


ye vow 


div A = 


In cylindrical co-ordinates, this reads 


div A = ar (A,r) Trae (Ag) + — 2 (Aw)| [276] 
or 
, 10a 1 0Ag 0A, 
div A = ; ar (rA,) + F ae + 3g [277] 


In spherical polar co-ordinates, 


1 
div = perite (A,r sin 0) + 2 (Aw sin 8) + = 2 Aw) [278] 
or 
diva = = = (PA )+— 2 jena ade [279] 
ae rsin§ a¢ 


The expressions for the components of the curl of the vector function 
A are obtained through applying the defining relation 89 to elementary 
curvilinear rectangles lying in planes normal to the directions of 7, 4.,. 
and i. Except for differentials of higher order, the opposite sides of such 
a rectangle are equal. Considering the u-component, the sides of the 
rectangle are ds, = e, dv and dsw = Cw dw. The line integral of A taken 
around this rectangle is expressed by 


GA-ds = [Avdslw + [Av dsulotae — [Av dsopte ~ [Aw dsu}> [280] 
But 

[Adland + 2 [Aw ds)» do [281] 
and 


(Avdelase =A dele a [Ay diy] » deo [282] 
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so that Eq. 280 becomes 


a t) : 
GA ds = — [Ay dsv]o dv —— [Ay dso] do [283] 
or 
fA ds = {2 (Ace) — 2 Avs)eh dod [284] 
a =. au wlw)y aw wu) w Ww 
The enclosed area is da, = e,€y dv dw. Hence by Eq. 89, 
1 {a ts] 
curl, A = ae (2 (A wew) — roe (Aves)} [285] 
and similarly 
1 0 0 
curl, A = ve ea (Aueu) — os (Awen)| [286] 
1 
curly A = {2 (4) ~ 2 (Avea)} [287] 


In cylindrical co-ordinates, these expressions become 


ak he = - fas | [288] 
curly A = a = on [289] 
curl, A = ie (rA 4) et [290] 
and in spherical polar co-ordinates, they read 
ee r ae 6 30 Bro = r in 0 x [291] 
curl, A= Pint 36 zs ta (rA4) [292] 
curl, A = —— (VA 0) — ; ee [293] 


The evaluation of VU is obtained through substituting the components 
of grad U from Eq. 264 for the components of A in Eq. 275. This gives 


1 8 [erew O 0 feye, OU 0 (eye,dU 
2 ee icant [rnc ww OU MP oer, oe uve 
vee Cyl rw ‘e( Cu ov) 7 =( Cy aa a 2(* | [294] 
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For cylindrical eens this yields 
_1a/( aU 10°U aU . 
oop ar +3 ar Oe re [295] 
In spherical polar co-ordinates, 
1o aU 1 0U 
arr = 2 (204 
veo ae(r oe) + ana aoe sino + sin? 6 29 ag? [296] 


The first term on the right-hand side - this equation may be written in 
other forms by noting that 


_ 8, 230 _ 12) 
r ral ) ee ror or or [297] 


PROBLEMS 
1. Let A, B, C be three vectors extending from a fixed point, O, to the points 


a, b, c, respectively. Express the directed segments mr ad ac ac in terms eee B, C. If 6 
lies on the line ac and divides it in the ratio 
ab — af 
—=MmM “8 
ac 


te | 


write the expression for B in terms of A, C and m. Conversely, if B= mA + nC, 
where m and m are scalars such that m + = 1, show that 5 must lie on ac and find'the 
ratio 


ab 
ae 


2. Let A, B, C, D be four vectors extending from a fixed point O to the points 
a, b, c,d, respectively. As in Prob. 1, show that the necessary and sufficient conditions 
yielding a, 6, c, d coplanar are expressed by the relations 


a+B+y7=1 
 ‘D=ad +BB+y7C 


in which a, 8, y are scalars. 
3, The vectors from the origin to the points a, b, c, d are 


A=itjtk 
B=i2+93 
C =13 +75 — k2 
D=-j+k 


Express the vectors ab, be, ed, dai in terms of the unit vectors i, 7, &. Show that ab and 
Pe: are parallel. 
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4, Prove that the absolute value or length of a vector A is given by 
A=VA-A 
Find the lengths of the vectors A, B, C, and D in Prob. 3. 
5. Evaluate the following scalar products: 
(i) G3 +J4 + #2)- (j — 85) 
Gi) @ +32 — k3)- G4 +7 + 2) 
(iii) (450 + 235) - (—13 — 717 + k2) 
6. Verify Eq. 3, page 188, for the case where 
A=12+ 73 -—k 
B=-i-j+B 
C=i-j2 eye | 
D=-i24+j7 +8 
7. Find the cosine of the angle between the vectors 
A =i-—jy—h2 
B=i24+j — k2 


8. The sides of a triangle are vectors A, B, and C, of length a, b, and c, respectively, 
such that 


A=B-C 


By using scalar multiplication, square both sides of this equation and, by interpreting 
the result geometrically, obtain the law of cosines. 


9. If A and B are nonzero vectors, show tliat the necessary and sufficient condition 
for A and B to be: 


(i) parallel is AxB=0 
(ii) perpendicular is A- B = 0 
10. Evaluate the following vector products: 
(i) @2+j2- 2) G+) 
Gi) @S —j + k)x G2 —7 + k2) 
Gi) G@+j+k)*@ +7 + 22) 
11, If A and B are sides of a triangle, show that the area of the triangle is given by 
area = $V (Ax B)- (A xB) 


‘Calculate the area when 


A=it+tjtk 
B=i2+4+k3 
12. Prove Lagrange’s identity 
(A-C) (A-D) 


(Ax B)+(CxD) = (B-C) (B-D) 


Use this to find an alternative form for the area formula given in Prob. 11, 
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13. Using the vectors given in Prob. 10 (iii), compute V., V,,.V., and the projec- 
tions A), AZ), AG of Eq. 14, page 191. Find the angles 0,,, 0,2, 9,y, and in terms of 
these results verify Eqs. 16, 17, and 18, page 191. 


14. If A, B, C are coterminous edges of a tetrahedron, show that the volume of the 
tetrahedron is given by 


volume = }|A x BxC| 
Calculate the volume when 


15. Show that 
(Ax B)x(CxD) = (A*B-D)C —-(AxB-C)D 
=(AxC-D)B —(BxC-D)A 
16. Prove the formula 
(A x B)+ (BxC)x (Cx A) = (AxB-C)? 


17, If U = x? + 4y? 4+. 162°, find grad U at the point P(2,2,1). What is the shape 
of the constant value surface through P? Find the directional derivative of U, 


aU 
Os 
‘at P, if the direction of s at P is along the vector 
A= —-i+j2+h2 
18. Verify Eq. 48, page 198, if 
U =22 +42 
V = xyz 
19. Find a function U (x,y,z) with gradient equal to 
grad U = idx 4 j5y* + 423 
20. In fluid dynamics, there arises the energy function 
$(x,y,2) = Jt “de 
nd 


where the density of the fluid p(x,y,z) is a function of the pressure (x,y,z). Find the 
general expression for 


grad 
If in particular 

b=? +y? +27) -—§ 
and 


o=cb (c, a constant) 
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- show that 


cuts Lae 
21. Let U = x? + 4y? 4 162? define a potential. Calculate the line integral 
J grad U- ds 
from P;(1,1,1) to P2(2,2,2): 


(i) By integrating along straight lines from (1,1,1) to (2,1,1) to (2,2,1) to (2,2,2). 
(ii) By integrating along the straight line P1P». 


22. Calculate the divergence of the following vector fields at the points indicated. 
Which of the points represent sources? Sinks? 


(i) A = a(x? — y?) +7 (@? — 2) + ky? — 2%) at (2,3,1) 
(i) A = (ix + jy + kz) (x? + y? + 2%) at (—1,2,2) 
(iti) A = (432 — 78x + k5y) (xy) at (1,0,0) 
23. By considering the defining equation 56, page 201, calculate div (ix. + jy + kz) 


at the origin, using a sphere of radius Ar, with center at the origin, as the infinitesimal 
volume. 


24. Determine the volume of fluid per second flowing out of a spherical region of 
radius 3 feet if the vector velocity field for the region is given by 
V = ix? + jy? + kz? feet per second 
referred to a set of axes with origin at the center of the sphere. 
25. Calculate the volume of fluid per second flowing into a cube measuring two 
feet along each edge if the vector velocity field is given by 
V = —ix3y? + jy’z + kz’ feet per second 
referred to a set of axes with origin at the center of the cube and such that the x-, y- 
and z-axes are perpendicular to a face of the cube. 
26. Show that the volume enclosed by a closed surface S is 


volume = 4 i V(r?) - da 


where r is the distance from the origin to the variable point; that is, 
r= Vx? + y? + 22, 

27. Find the potential U(x,y,z) associated with a vector force field which is 
directed toward the origin, with magnitude inversely proportional to the distance 
from the origin. (Hint: By symmetry, U will be a function only of r, the distance from 
the origin.) 

28. The velocity of a fluid is radially outward from a point source and is propor- 
tional to the distance from the source. Find the velocity potential associated with this 
vector velocity field. 


29. A sphere of radius a with center at the origin contains a space charge of density 
p =a? — 7? (wherer = Vx? + y? + 2?) 


There are no charges exterior to the sphere so that the resulting field will possess 
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spherical symmetry, that is, 
A=f(r)r - (wherer = ix + jy + kz) 

Using Eq. 64, page 205, on a sphere of radius r, evaluate f(r) and thus show that 

8r a> 

rt 


A= (when r > a) 


and 


at 7% | 
A=4n>—-%F (when r < a) 


* 30. In the preceding problem the potential U associated with the field A may be 


coieputed from 
U= f A-dr 


(This may be aie of a as the work done by the field when a unit charge, upon which 
it acts, moves from 7 out to infinity. ) Calculate the expression for U vend outside the 
sphere and show that U satisfies 


VU = 0 


Also find the potential U valid for pe inside the sphere aad alice that here U 
satisfies 
VU = ee 


31. On the surface of a sphere of radius a and center at the: origin is a surface 
charge of constant density o. By the same procedure used in Probs. 29-and 30, de- 
termine the vector field A, and the scalar potential U, for points interior and exterior 
to the sphere. Show that ‘Laplace’ 's equation is satisfied for all points except on the 
surface of the sphere, and that on the surface itself, Eq. 68, page 206, is esane: 


32. Calculate the curl of each of the following vector fields: 
(i) A = —ty + je 
(ii) A = iBxy + j2yx + kyz 
| Gil) A = tye $j? +9? +) Haw] ty +8) 
33. Verify Stokes’s theorem by computing separately 


_ and 


in the case where 
‘A = ty-— je 


and the surface 5 is the hemisphere of radius b, above the xy-plane, whose boundary: 
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L is the circle 
x2 + vy? = b2 
in the xy-plane. 
34. Evaluate 


pads 


around a square of side b, which has a corner at the origin, one side on the x-axis, and 
one side on the y-axis, if 


A = i(a? — y?) + flay 


35. Consider an infinitely long, straight vortex thread of constant moment J. By 
using Eqs. 102 and 103, page 213, together with the fact that the resulting field will 
possess cylindrical symmetry (that is, the lines of force will be circles about the vortex), 
deduce the expression for the vector field A. Use cylindrical co-ordinates with the 
z-axis along the vortex. 

Show that the potential U which satisfies Eq. 130, page 220, is given by 


U = 21) — 0) 


where @ is the angle in cylindrical co-ordinates. Verify that this is a multivalued 
potential function which satisfies Eq. 136, page 222. 

36. Consider an infinitely long right circular cylinder of radius 6 whose axis is the 
z-axis of cylindrical co-ordinates. Interior to the cylinder is a vortex density J defined 
as follows; the direction of J is always parallel to the z-axis and the magnitude is 
given by 


Fe 
The exterior of the cylinder is assumed to be vortex-free. Using the same sort of 
reasoning as in the preceding problem, determine the vector field A, obtaining ex- 


_ pressions valid inside and outside the cylinder. For the inside of the cylinder, find the 
vector potential P which satisfies © 


a A curlP =A 
div P =0 
(Hint: From Eqs. 126 and 127, page 219, it can be seen that the direction of P is the 


same as that of J.) 
For the outside of the cylinder, find the scalar potential U which satisfies 


grad U = —A 


37. A point P(«,y,z) moves along a space curve in such a way that its co-ordinates 
are given by the following functions of time: 


x = acost 
y =asint 
z= bt 


Find the vector velocity and acceleration of the point P, 
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38. The potential energy of the region in the preceding problem is given by 


= an coe sin ¢ 
a by? 
Find dU/dt for the point P(x,y,z) of Prob. 37, at time z. 
39. With reference to Probs. 37 and 38, let A = — grad U. Determine dA /dt for 
the same P(x,y,z) at time ?. 


40. Let 
y atte 
2 
A = i(x? + yz) + j(y? + 2x) + ke? + xy) 
B = i(xy) + j(yz) + k(x) 
Calculate: 


(i) grad U 

Gi) div A, div B 
(iii) curl A, curl B 
(iv) div (UA) 

(v) curl (UA) 
(vi) div (A x B) 
(vii) curl (A x B) 

(viii) grad (A + B) 


41. If A is a constant vector, prove 
V(A-r) =A 
42. Let Vy and Ve be vectors from the fixed points (%1,1,21), (2,¥2,%2) to the 
variable point (x,y,z). Show that ; 
(i) div (V1 x Vo) =0 
(ii) curl (Vi = Ve) = 2(Vi — Ve) 
(iii) grad (Vi- Ve) = Vit Vo 
43. If A is an arbitrary vector field, evaluate 
(AxV)-R 
and 
(AxV)xR 
44. Obtain Eq. 287, page 243, directly from 


eu aU aU 


VU (x,9,2) Fo. Ox? + Oy? + O22 


by the substitution given by Eqs. 235 to 237, page 237. 


45. Obtain Eq. 288, page 243, directly as in Prob. 44 by means of the substitution 
given by Eqs. 245 to 247, page 238. 
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46. Suppose that ‘ 
vy =v(@1) 
ay = ¥i(@e) 
ae = Yo(as) 


a series of relations through which y is expressed as a function of u, v, w. Show that 
OY aida,  Oan-1 


47. Apply the formula of the preceding problem to compute the gradient of 
; y = Insin ¢iinh (A’B) 
in which 
A= yy + tye + iwag 
B=iyu + inv + yw 


the quantities @1, d2, a3 being constants. 

48. Consider a rectangular co-ordinate system with origin at the point O and let 
Q and P be two other points separated from O by the distances p and R respectively. 
Let r denote the vector distance from Q to P (as defined in Art. 17) and similarly let 
p and R denote the vectors OQ and OP. The angle between the latter two vectors is p. 
If Q is assumed fixed while P moves upon the surface of a sphere with radius R, 
compute the magnitude and the direction cosines of the vector gradp cos y. Show that 
this vector is perpendicular to R. 

49. With reference to the previous problem compute the magnitude and direction 
cosines of the vector gradg cos y when P is fixed and Q moves over the surface of a 
sphere of radius p. Which vector, p or R, is perpendicular to gradg cos y? 

50. If (r) is some scalar function of the distance as defined in Prob. 48, show that 


gradp =F grader = — gradgr 


Compute the magnitude and the direction cosines of this vector if 


e7dkr 
o = - (Rk being a constant) 


_ 51. Referring to the statement of the previous problem, show that 
Vo'b = Vr's = — b6 
52, If a is a constant vector and ¢(r) a scalar function of r, show that 
Vox (ab) = axVep 


and 
Vax Vax (ab) = Vo(a- Vad) — aVob 


53. Prove the vector identity 
1 
E A [Ax ExA + (EZ: A)A] 


in which Z and A are any vectors. 


| 
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54, Prove the follqwing vector relations: 
E-(VxVxao) = pH: a+ E-V(a- Vo) 
E-V(a- Vo) =V- [(a- Vb)E] — (a: Vb)(V- £) 
E. (VxVxad) =a- {RE — (V- EVO} +V-[(a- VO)E] 


in which E is a vector function of x, y, 2, @ is the scalar function defined in Prob. 50, 
and a is a constant vector. 

55. If A and B are any two vector functions of the space co-ordinates, show by 
means of Gauss’s law that 


[Be V1Vd A Weve B) a= f [4x VxB — Bx Vx A] da 
pe 


in which da is a vector element of the surface S and dv is an element of the enclosed 
volume V. 


CHAPTER VI 


Functions of a Complex Variable 


It is assumed that the reader has some acquaintance with the subject 
of complex numbers and the representation of them in the complex plane 
‘(also known as the Gaussian plane). Here it is customary to consider the 
x-axis as the “‘ axis of reals ” and the y-axis as the “ axis of i imaginaries.” 


The symbol j = /—1 prefixed to a real number signifies that the latter 
is the imaginary or y-component of a given complex number. The y-axis 
is, therefore, sometimes referred to as the “ j-axis.”’ 

A complex number z = x + jy is plotted in the complex plane as the 
point P@,y), and z is interpreted geometrically as a vector* drawn from 
the origin to this point. This interpretive procedure is referred to as the 
rectangular representation of the complex number (or vector) z. Its 
polar representation is given by z = pe’®, whence p = Vx? + y*, and @ = 
tan! (y/x). The familiar law of parallelogram addition applies to the 
addition of any given set of complex numbers. The details of this process 
_ as well as those involved in the multiplication and division of complex 
numbers are not further elaborated upon here. 

The object of the discussion in these pages is rather to give the reader 
some acquaintance with a complex function whose independent variable 
is a complex number. The most obvious novelty exhibited by the function 
of a complex variable, contrasted with a real function of a real variable, 
lies in the fact that the values of both the function and the variable 
are no longer characterized by single numbers; two numbers are now 
required for the specification of each. Thus a value of the variable z in- 
volves the specification of the two quantities x and y, and since the 
function of z is likewise,complex, its values also involve the specification 
of a real and an imaginary part. 

In view of this situation it is clear that the process of graphical rep- 
resentation requires new methods in the case of complex functions. In 
addition, one should carefully review the fundamental operations of 
the differential and integral calculus in order to see whether their familiar 
interpretation may be extended in some way to apply to complex func- 
tions. This extension should be made with the minimum possible change 
in basic conceptions in order that the many useful relations known with 


*Complex numbers are frequently referred to by electrical engineers as vectors. It has 
been pointed out that the term “ vectors ” is in this connection misused and that such misuse 
may lead one to draw false conclusions or otherwise fall into dangerous byways of thought. . 
This view is not shared in this book, principally because the discussion given in Art. 5 shows 
that it has little justification. 
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regard to functions of a real variable may become available also to the 
manipulation of the complex ones.* 

The most useful operations familiar to the reader in connection with 
functions of a real variable are those of differentiation, integration, and 
expansion in series. A discussion of the extension of these conceptions to 
functions of a complex variable is the principal aim of the following 
articles. 

In these discussions no particular attempt is made to give rigorous 
proofs. The derivations are given entirely for the purpose of providing 
the reader with a partial insight into the relevant fundamental inter- 
relationships. Wherever possible, an attempt is made to establish contact 
between the new conceptions and those with which the student has an 
acquaintance of longer standing. To the engineer who is trying to gain a 
working knowledge of function theory, rigorous proofs are a waste of 
effort, but plausibility arguments do serve a useful purpose in that they 
provide the necessary circumspection for facile and intelligent use of this 
important mathematical tool, and at the same time lay the groundwork 
for a more thorough study which may seem desirable at a later time. 


1. DIFFERENTIATION 


The complex function of a complex variable z is denoted by 


w = f(z) [1] 
with 
z=a+ijy [2] 
and . 
w= ut jv [3] 


Therefore # and v are functions of both « and y; thus 
u=u(xy), v= v(a,y) [4] 


It should be observed that both the real and imaginary parts u and v 
are real functions of the two real independent variables x and y, which 
are the real and imaginary parts of the complex variable z. The language 
used here may be somewhat confusing to the reader, inasmuch as v and y 


*This approach to the consideration of the theory of functions of a complex variable may 
appear to some readers to be somewhat strange. It should be remembered, however, that 
these pages are addressed primarily to the engineering student whose previous experience 
with mathematics has been confined almost wholly to real functions of a real variable. To him 
the process of regarding the present discussions as an extension of some of the manipulations 
which apply to functions of a real variable not only appears to be sensible but also is the 
course which his process of learning will take in any case. 
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are spoken of as the “ imaginary ” parts of w and z respectively, whereas 
at the same time they are pointed out as being real quantities. The 
strangeness of this method of expression should, however, readily be 
overcome by concentration upon its mathematical significance as ex- 
pressed by the relations 1 to 4. 

It is now of interest to examine whether the derivative of the function 
w with respect to the variable z may be interpreted as the limit of the 
ratio Aw/Az, in which Aw and Az represent corresponding increments, as 
the increment Az approaches zero. Reflecting upon this situation at once 
discloses an apparent difficulty, since one is reminded of the fact that 


Az = Ax +7 Ay. [5] 


and hence that Az may be interpreted in an infinity of ways. If one 
assumes for the moment that the increment Az has a fixed magnitude, 
its direction in the complex plane may be varied in an infinite number of 
ways, thus yielding an infinite number of corresponding increments Aw 
in the function. It does not necessarily follow, of course, that the ratio 
Aw/ Az correspondingly assumes an infinite number of values, but unless, 
in the limit Az — 0, this ratio is independent of the direction of Az, the 
derivative of the function w evidently does not possess a unique value. 

’ Whereas it is conceivable that the extension of the usual conception of a 
derivative to functions of a complex variable may require a distinction 
with regard to the direction assigned to the increment Az in the complex 
plane, the simplicity and general usefulness of this derivative would 
unquestionably be greatly impaired if its value were subjected to such a 
restriction. The undesirability of the latter suggests that one ought to 
demand that the derivative operation be completely free from this 
restriction, and then inquire into the bonds which are thereby laid upon 
the nature of the complex function. If these are not so severe as to rule out 
of consideration such classes or kinds of functions as one would like to see 
embraced by the theory which is the object of this discussion, one may 
still be served by pursuing it. 

As will be seen shortly, it turns out that this point of view may well be 
taken and, surprisingly enough, that more functions out of a set written 
down at random fall into a classification which meets these bonds than 
_ one might at the outset expect. In fact, the results are so gratifying in this 
respect that one is justified in ruling out of further consideration all 
complex functions which do not comply with this demand and in stipulat- 
ing that the term “ function of a complex variable ” shall apply only to 
those that do. 

The conditions under which the derivative of the function w with 
respect to the complex variable z may be independent of the direction in 
which the increment Ag is taken are readily found by first indicating the 
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derivative by means of the relations 2 and 3 as 


dw du+jd [6] 
, dz dx+ jdy 
Now from Eq. 4 
Ou ou 
an REE ad [7] 
and 
ov ov 
idea ee aa [8] 
Substituting into Eq. 6 and writing the result in the form 
ou ov ou dy 
dw aa + ax +(* +9 2 
SS [9] 
dz 1+ . dy 
Ia dx 


show that the direction of the differential increment dz is determined by 
dy/dx, and hence, if the expression 9 is to be independent of this direction, 
that the necessary conditions are expressed by 


ou . ov 


i hae 
ox (1 
au, , 00 9 [10] 
ay | 7 ay 
or 
Ou . ov ov . Ou 
ay nd ay = et oe [11] 


en oy [12] 
and 

ou ov 

ay ee [13] 


These are the conditions which the real and imaginary parts of the com- 
plex function w = f(z) must fulfill in order that its derivative may have 
a unique value for any point z, regardless of the direction of the increment 
dz at this point. Equations 12 and 13 are known as the Cauchy-Riemann 
partial differential equations (or condition equations). Only those functions 
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w =u -+ jv which satisfy these equations are henceforth to be called 
functions of a complex variable. 

Practically all the common functions familiar to the reader are found 
to satisfy the conditions 12 and 13. The simplest of these functions is 


w=Z [14] 
for which 
“=x and v=y [15] 
is obviously a function of a complex variable. 
A constant times an integer power of z, namely 
w = az" [16] 
is likewise seen to satisfy Eqs. 12 and 13. Hence it follows that any 
polynomial 


W = On2" + dn_12™! +--+ + a3 + a [17] 
or any quotient of polynomials : ; 
_ On" + dy 18"* + +++ + az + dy 


aa et RE eT ERENT CPT [18] 


are also functions of a complex variable. The familiar trigonometric, 
hyperbolic, and exponential functions as well as the logarithm, when 
regarded as functions of g, all satisfy Eqs. 12 and 13. Fractional powers of 
z and fractional powers of polynomials in z satisfy the conditions. It is, in 
fact, more difficult to find functions that do not satisfy the conditions 12 
and 13 than it is to find those that do. A few exceptions are 


w=l=VeP+y¥ [19] 
and 
w=2=x—Jy [20] 


but even these simple exceptions are rather peculiar and hardly worth 
bothering with anyway. 

It is appropriate to point out that the fulfillment of the Cauchy-Rie- 
mann equations does not suffice for the existence of the derivative. The 
latter requires that the partial derivatives 12 and 13 be continuous func- 
tions of x and y in the vicinity of the point in question. 

A point at which the function is not differentiable is called a singularity. 
If the function is differentiable everywhere within an arbitrarily small 
region in the vicinity of some point, it is there said to be regular or 
analytic (the term holomorphic is also used to describe this property). A 
region throughout which a function is analytic is spoken of as a region of 
analyticity. 
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2. A GRAPHICAL REPRESENTATION; CONFORMAL MAPPING 


Since the complex function w as well as the complex variable z require 
two quantities (their real and imaginary parts) for their description, the 
graphical representation used for functions of a real variable is not 
available for the plotting of functions of a complex variable. Instead, the 
values of the variable z are plotted in one complex plane (the x,-plane or 
z-plane), and the corresponding values of the function w are plotted in 
another complex plane (the ,v-plane or w-plane). A given point in the 
z-plane represents a complex value for the independent variable z which 
determines a value for the function w, and this value in turn determines a 
corresponding point in the w-plane. A continuous curve in the z-plane may 
be thought of as defining a set of points in this plane, and if the function 
w = f(z) is continuous throughout this range of z-values, a corresponding 
continuous curve is thereby determined in the w-plane. 

The construction of a family of curves throughout a given region in 
the z-plane makes it possible to map the behavior of the function over a 
corresponding region in the w-plane. For this purpose the curves drawn 
in the z-plane may, for example, be the sets of straight lines defined by 
x = constant and y = constant, the constants being chosen so that these 
orthogonal families of lines form a uniform grid. Alternatively, a set of 
circles concentric with the origin and the orthogonal set of radial lines 
may be drawn in the z-plane, or one may select any other sets of curves 
which appear best to serve the purpose in view of the particular function 
under consideration. 

It is useful to note a very interesting property of functions of a complex 
variable which is made evident by such a graphical representation. This 
property follows directly from the fact that the derivative of the function 
is independent of the direction of the vector increment dz in the z-plane. 
Assuming for the moment that the increments are finite, the derivative 
is approximated by Aw/Az. If the increment Az is thought of as having a 
fixed magnitude but any desired angle, the fact that the quotient Aw/Az 
has the same complex value regardless of the angle of Az means that 
variations in the angle of Aw are exactly equal to any assumed variations 
in the angle of Az. In other words, since the angle of the complex quotient 
Aw/Az is independent of the angle of Az, the changes in the angles of Az 
and Aw must always be equal. 

Any two increments A,z and Agz differing only in direction at a given 
point in the z-plane may be looked upon as a pair of path increments 
along any two curves which intersect at that point; and, similarly, the 
two corresponding increments A,w and A,w (whose magnitudes are alike 
because A,w/A,z = Agw/Agz) may be looked upon as a pair of path 
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increments along two curves intersecting at the corresponding point in 
the w-plane. Since the angle between the path increments A,;z and Agz 
equals the angle between the corresponding path increments A;w and 
A,w, any sets of curves drawn in the z-plane intersect at the same angles 
as the corresponding curves in the w-plane at all corresponding points. 
The process of mapping curves in the w-plane corresponding to any 
chosen curves in the g-plane, in other words, preserves the angular rela- 
tionships between these curves at all corresponding points. For example, 
if two sets of curves drawn in the z-plane form orthogonal families, the 
corresponding sets of curves which are mapped in the w-plane by means 
of any function w = f(z) (satisfying the conditions 12 and 13, of course) 
likewise form orthogonal families. 

‘It should be observed that if the angle of the increment Agz is larger 
(or smaller) than the angle of A,z, the angle of Agw is likewise larger 
(respectively smaller) than that of A,w. In other words, the angular 
increment between A,w and A gw is equal to the angular increment between 
Az and Agz not only in magnitude but also in sense. That is, if the rotation 
from A,z to Az in the z-plane is, for example, counterclockwise, the rota- 
tion from A,w to Agw in the w-plane is also counterclockwise. 

If w = f(z) is a function satisfying the conditions 12 and 13, the func- 
tion @ = f(z), in which the bar indicates the conjugate value, evidently 
does not satisfy these conditions. Since, for a given Az, the increments 
Aw and Aw have the same magnitudes but opposite angles, it is clear that 
the function # = f(z), in its mapping properties, preserves the angular 
relationships in magnitude but reverses their sense (as is the case with a 
picture and its mirror image). The mapping properties of both the func- 
tions w = f(z) and w = f(z) are said to be isogonal (meaning that the 
magnitudes of angular relationships are preserved). In addition, the 
mapping property of the function w = f(z), which preserves the sense as 
well as the magnitude of angular relationships, is described as being 
conformal. 

As a consequence of the property of conformality, one may see that if a 
small region of a map in the z-plane (with numerous intersecting curves) 
is compared with the corresponding small region in the w-plane (with 
numerous corresponding intersecting curves), these two small mapped 
regions are found to be exact replicas of each other except for a factor of 
magnification (or diminution) equal to the magnitude of Aw/Az at the 
point where this region is located, and a rotation through the angle of 
Aw/ Az. This observation is strictly accurate, of course, only in the limit 
as the size of the entire region tends to zero, but for small regions of finite 
size, the two corresponding maps are very nearly alike in detailed form. 
The term “ conformal ” is thus seen to assume a clearer significance. 
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3. THE INVERSE FUNCTION 


The corresponding maps in the w- and z-planes for a given function 
w = f(z) place in evidence a mutual relationship between the two com- 
plex quantities w and z in the sense that either one may apparently be 
looked upon as the independent variable. In other words, the given 
function 


w = f(z) = u(x,y) + jo(a,y) | — [2] 
may presumably be inverted to yield 
z= $(w) = x(u,v) + jy(u,r) [22] 


at least over regions throughout which a one-to-one relationship exists 
between w and z. This thought may be investigated further through the 
consideration of the relations 


ox ox 
Ds 29 ay 
dy = oi du + a dv [24] 


which are the inverse of Eqs. 7 and 8. 
Denoting the determinant of Eqs. 7 and 8 by D, and noting Eqs. 12 
and 13, one sees that 


ou dv du dv ou 
Deg pee) Pg) at) 
But, again with the help of Eqs. 12 and 13, one has* 
aw , ou dv du Ou ov . Ov 
ea @)=Etiigtaytigg — Bel 
so that ae 25 yields 
D = {f'(z)|? [27] 


Now Eggs. 7 and 8, on the one hand, and the pair of inverse Eqs. 23 
and 24, on the other, must have inverse matrices; that is, 


ax ax) [ou aut 

ou ov] | dx oy 

ay ay|~] a0 av a 
ou av ox ay 


*The correctness of these relations should be clear from the fact that the value of the 
derivative is independent of the angle of the increment dz = dx + j dy. If this angle is zero 
then dy = 0, which means that # and v are differentiated partially with respect to x only. 
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from which it follows that 


ede oF Dey [29] 
Hence 
oe oY [32] 


which are the Cauchy-Riemann equations pertaining to the inverse 
function 22. 
With the help of these various relations, one may now write 


av, ov 
dz _ ax ay _ dy Joe _ 
dw ou '’au D OU 
dy an 
Pee Se ee ee a 
~ ou, ,av fi(s) \dz [33] 
ant! ax 


According to Eqs. 31 and 32, the inverse function.22 is also a function 
of a complex variable, and Eq. 33 shows that the derivative of the inverse 
function is the reciprocal of that for the given function at corresponding 
values of w and z. In other words, the conformal maps for the function 
w = f(z) may likewise be regarded as the maps for the inverse function 
z= o(w). 

A precautionary remark may be made here about difficulties of inter- 
pretation in dealing with multivalued functions. Although these matters 
are discussed in greater detail in subsequent articles it should be observed 
now that, in view of Eq. 33, the derivative of the inverse function evi- 
dently does not exist at a point where that of the given function w = f(z) 
becomes zero. In the immediate vicinity of such points, the maps in the 
w- and z-planes for a given function are still uniquely related, although the 
preservation of angular relationships no longer holds (as is further dis- 
cussed in Art. 14). 
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4. THE z-PLANE AND ITS ASSOCIATED COMPLEX SPHERE; THE POINT 
AT INFINITY 


At various times it is expedient to consider the value or behavior of a 
function at infinity, that is, in the limit z—> . Since any point in the 
complex z-plane which is infinitely remote from the origin is a point at 
infinity, it may seem as though infinity should be regarded as a vast 
region embracing an infinity of points. While admissible on purely logical 
grounds, this view is extremely awkward from a mathematical standpoint, 
since the behavior of a function “at infinity”’ would embrace its behavior 
at an infinite number of points. 

The difficulty involved here is readily overcome, however, through 
introducing (by definition) a slightly altered conception of the complex 
plane. Thus it is perfectly admissible to think of this plane as being the 
surface of a sphere of infinite radius, or, for the sake of easing the mental 
strain produced by this conception, as a sphere with so large a radius that 
any finite region that may be considered appears for all practical purposes 
to be flat. If the origin of this ‘‘ plane ” is taken to be the south pole of 
the sphere, all points infinitely remote from the origin coincide at the north 
pole. Infinity is then no longer a vast region but becomes a single point. 
It is called the point at infinity. 

In order to overcome the necessity of thinking of the z-plane as an 
enormous spherical surface, another artifice may be utilized which in some 
respects has certain advantages over the infinite sphere idea. The z-plane is 
visualized as a truly flat surface, with a sphere of arbitrary but finite 
radius resting upon its origin. The point of tangency between the sphere 
and the z-plane at its origin may be taken as the south pole of the sphere. 
The corresponding north pole is then perpendicularly above the origin. 

A given finite point zo in the z-plane is now thought of as joined with the 
north pole of the sphere by a straight line, which intersects the surface of 
the sphere at one point other than the north pole. This geometrical con- 
struction, which is called stereographic projection, associates a point on 
the sphere with every point in the complex z-plane in a manner unique 
for all points except those infinitely remote from the origin. All these 
correspond to the north pole of the sphere. Thus, with the help of stereo- 
graphic projection, infinity is again interpreted as a single point. 

In all considerations regarding regions and paths in the complex plane, 
the corresponding ones on the surface of the sphere may be substituted. 
It can be shown geometrically that the process of stereographic pro- 
jection preserves the magnitudes of angular relationships between inter- 
secting curves in the plane and the corresponding ones on the spherical 
surface (the process is isogonal). Circles in the plane are circles on the 
sphere. In particular, any circle on the sphere which passes through the 
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north pole becomes a straight line (circle with infinite radius) in the 
plane. A great circle through the north pole (meridian) corresponds to a 
straight line drawn through the origin (radius vector) in the z-plane. 
Such a sphere is referred to as the complex sphere associated with the 
z-plane. A similar sphere may evidently be associated with the w-plane 
in connection with the mapping of any function w = f(z). Because of 
the isogonality of the process of stereographic projection, it follows that 
corresponding maps on the two spheres for a given function are conformal, 
so that the spherical surfaces may in all instances be used to replace the 
complex w- and z-planes. In this way the pro¢ess of conformal mapping 
may readily be visualized over regions which include the point at infinity. 


5. ALTERNATIVE GRAPHICAL AND PHYSICAL INTERPRETATIONS 


A number of additional interesting properties of functions of a com- 
plex variable may be studied through identifying the complex plane with 
a cross-sectional plane associated with a physical system having longi- 
tudinal uniformity. This direction coincides with what is ordinarily 
designated as the z-axis of a rectangular co-ordinate system. A static 
field (electric, magnetic, or hydrodynamic) associated with such a 
supposed physical system has either a zero or a constant component in 
the longitudinal direction. This component is ignored. In the following 
discussion, and wherever the physical argument requires three-dimen- 
sional or space consideration, it is understood that a unit length in the 
longitudinal direction is implied. In other words, the field is regarded as 
a two-dimensional one, since its behavior is of interest only in the cross- 
sectional plane which is identified with the complex x,y-plane. 

In such a system, a vector function A (x,y) is assumed to represent 
the flow of some physical or fictitious fluid. In complex form 


A=A,+jA, [34] 
With reference to Fig. 1, S represents a closed (mathematical) bound- 


ary. If a differential length of this boundary is denoted by-ds, the net flow 
outward through the boundary is given by the integral 


¢ An ds [35] 


in which A, is the normal component (directed outward) of A. According 
to the geometry shown in the figure, and on the assumption that the 
integration is extended around the closed contour in the counterclock- 
wise direction, this integral may be written 


f Ands = g (Ade Avady) [36] 
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From Gauss’s law, it is recog- 
nized that this net flow outward may 
alternatively be calculated by integra- 
ting the divergence of A over the 
surface* enclosed by the boundary S, 
that is 


An is fai da [37] 


enclosed surface 
But 
‘Fic. 1. Relevant to the integral of the 
normal component of a vector function div A = dA, oAy [38] 
around a closed contour. Ox oy 
and hence Eqs. 36 and 37 yield 
0A, | OA, 
§ (Aedy ~ Ayde) = S( = + ne) aa [39] 


enclosed surface 


If A, and A, are now identified respectively with the imaginary and 
the real parts of a function of a complex variable w = f(z), thent 


Az = 0(«,y) [40] 
A, = u(x) [41] 

and the Cauchy-Riemann Eq. 13 shows that 
div A = 0 [42] 


and hence that 
§ (Ae dy — Ay dx) = § (ody — ude) =0 [43] 


In order that the divergence of A with components defined by Eggs. 
40 and 41 shall be zero throughout the surface enclosed by the boundary 
S, it is necessary, of course, that the Cauchy-Riemann equations hold 
throughout this region. This condition requires that the function w = 
f(z) be regular throughout the region; otherwise its derivative does not 
exist at all points over which the surface integral in Eq. 37 or Eq. 39 
extends. 

Physically, Eq. 43 means that the field A is source-free throughout 


*The reader may again be reminded that a unit of length in the longitudinal direction is 
implied so that this surface integration is actually equivalent to a volume integration. 

fIt should be observed that w = A, + jA, and hence that w= f(z) should not be con- 
fused with the vector function A, Eq. 34. 
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the enclosed region, as evidenced by the vanishing of the divergence of A 
for all points within the region. Hence it may be said that the imaginary 
and real parts respectively of a function of a complex variable which is 
regular throughout a given region in a z-plane may there be regarded 
as the x- and y-components of a source-free field. 

If now the line integral of the vector function A is formed for the 
closed boundary shown in Fig. 1, with a counterclockwise direction of 
traversal, one has 


$ A-ds= § (A.dz + Aydy) [44] 
According to Stokes’s law, it is recalled that 
pA-ds= f (curl A)-da [45] 

e enclosed surface 


The curl of A, which is directed normal to the x,y-plane, is given by - 
what would normally be regarded as the z-component, that is, 


0A, dA; 
curl, A = re ay [46] 
Equations 44 and 45, therefore, yield 
_ (aA, aA ‘) 
f ade + Ay dy) = (4 Se) da [47] 
enclosed surface 


Again identifying A, and A, with v and 4, respectively, according to 

Eqs. 40 and 41, and making use of the Cauchy-Riemann Eg. 12, one 

finds that . 
curl A = 0 [48] 


If the function w = f(z) is regular throughout the region enclosed by the 
curve S, Eq. 48 holds for all points within this region, so that 


f (A, dx + Ay dy) = $ waz + udy) =0 [49] 


Hence it may be said that the imaginary and real parts respectively of a 
function of a complex variable which is regular throughout a given region 
in the z-plane may there be regarded as the x- and y-components of a 
field which is not only source-free but also nonturbulent. If the enclosed 
region contains points at which the derivative of the function w = f(z) 
does not exist, the relations 43 and 49 no longer hold. In view of the 
present discussion such singular points may be regarded as either sources 
or vortexes in which the origin of the field A, and hence that of the 
function w = f(z), resides. This interpretation makes it clear that unless 
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the function w has singular points somewhere in the z-plane, it must 
reduce to a constant or to zero. In other words, the singularities of a 
function are its “life-giving ”’ elements, and out of their nature and 
distribution alone does a function derive its individual properties and 
characteristics. This view leads to a useful method of classifying functions 
purely in terms of the nature and distribution of their singularities, which 
is briefly discussed later on. 

The Cauchy-Riemann Eqs. 12 and 13 provide a further physical 
interpretation for the real and imaginary parts of a function of a complex 
variable. Thus if Eq. 12 is differentiated partially with respect to « and 
Eq. 13 with respect to y, the subsequent addition of the two equations 
yields 


ox? ay? 
On the other hand, if Eq. 12 is differentiated partially with respect to 


y and Eq. 13 with respect to x, the subsequent subtraction of the two 
equations gives 


=0 [50] 


+—5=0 [51] 


These results are recognized to have the form of Laplace’s equation for 
the potential of a two-dimensional source-free field. Hence the real and 
imaginary parts of a function of a complex variable may be interpreted 
as scalar potential functions. As such they may be assumed to determine 
a pair of nonturbulent vector field functions. If these are denoted by A 
and B respectively, one may write 


Ou Ou , 
A; = ox. A, = ay [52] 
and 
ov ov 


ay [53] 


Because of the Cauchy-Riemann equations, these field components are 
related as expressed by | 


A; a By, A, = —B, 2 [54] 
and hence the scalar product of A and B vanishes; thus 
A-B= A,B, + A,B, = —A,A,y + ArA, = 90 [55] 


In other words, the two nonturbulent fields defined by the potential 
functions 4 and v are orthogonal to each other.. 
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Now it is further recalled (from the study of vector analysis) that the 
system of equipotential lines defined by the equations « = constant, and 
the flow lines for field A form orthogonal families of curves. The same is 
true of the equipotential lines defined by the equations v = constant and 
the flow lines for field B. Since the flow lines for field B are orthogonal to 
those for field A, it follows, therefore, that the equipotential lines defined 
by «= constant are orthogonal to those defined by »v = constant. 
Hence the latter coincide with the flow lines for field A, and the former 
coincide with the flow lines for field B. This situation forms an alternative 
basis for the graphical representation of a function of a complex variable 
and for its physical interpretation. 

Thus, instead of using the conformal maps in the w- and 2-planes, one 
may study a given function of a complex variable graphically by plotting, 
in the z-plane alone, the systems of mutually orthogonal curves defined 
by the equations wu = constant and v = constant. Throughout regions in 
which the given function w = f(z) is regular, these have the character of 
the equipotential lines and flow lines of a source-free, nonturbulent field. 
Singular points again have the character of sources or vortexes, the nature 
and distribution of which determine the properties of the given function 
w. Some aspects of these physical interpretations are discussed further in 
Art. 22. 


6. INTEGRATION; THE CAUCHY INTEGRAL LAW 


A certain orientation with regard to the question of differentiation 
having been gained, attention may now be directed toward the interpreta- 
tion of the integral of a function of a complex variable. Here the two- 
dimensional character of the independent variable again injects some 
novel considerations at the outset. Thus if the integral 


[iow [56] 


is formally regarded as representing the integral of a given function 
w = f(z) between two particular values 2; and z, of the independent 
variable z, the question of the choice to be made for the continuous 
sequence of values of z as it proceeds from the point z, to the point z2 
immediately arises. Such a continuous sequence of values evidently 
defines some path or curve joining the points 2; and 22 in the complex 
z-plane. Since any number of paths may evidently be chosen in the 
detailed process of evaluating the integral 56, the possibility exists that 
the value of this integral may not be unique. 

This question is similar to that arising in connection with the discussion 
of the differentiation of complex functions, and it is again due to the two- 
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dimensional character of the independent variable. Again it is felt to be 
highly desirable that the value of the integral 56 should be unique, and 
if possible, that the Cauchy-Riemann equations, which insure the 
uniqueness of the derivative, should be sufficient to insure also the unique- 
ness of the integral 56 so that its value 
may be independent of the path of in- 
tegration without need for the imposi- 
tion of further conditions upon the 
function w. 

In order to investigate this ques- 
tion it is expedient to consider the 
integral formed for a closed contour 
in the zplane, as shown in Fig. 2. 
This so-called contour integral is writ- 


J 


Fic. 2. Region of analyticity in ten 
the discussion of Cauchy’s integral 
law. ¢ f(s) dz 57] 


Here the contour C is assumed to be traversed in the counterclockwise 
direction, whence the enclosed region G is observed to lie on the left. 
Within the region G the function w = f(z) is assumed to be regular. For 
all points within the region G, therefore, the function is differentiable, 
and the Cauchy-Riemann equations are fulfilled. 

By substitution from Eqs. 2 and 3, the integral 57 becomes 


fe) de = § (u + jo) (dz + jay) 


= § wax —vdy) +i § (ode + way) [58] 


The closed contour C in Fig. 2 and the conventions regarding the direc- 
tion of traversal are essentially the same as those shown in Fig. 1 for 
the closed boundary 5; and since the function w = f(z) is assumed to be 
differentiable at all points within the enclosed region, the results ex- 
pressed by Eqs. 43 and 49 apply. Hence the important result follows that 


$ fl@) ds = 0 "> [59] 


Since the points z, and z, appearing in the integral 56 may be thought 
of as any two points on the contour C, as indicated in Fig. 2, it follows 
from Eq. 59 that the integral 56 is independent of the path of integration 
or that a given path joining 2; with zg may be changed at will without 
altering the value of the integral 56 so long as the path is not moved 
across a point at which the function w is singular. The latter restriction is 
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readily appreciated through noting that if the two portions of the contour 
C joining 2; and 22 in Fig. 2 are regarded as two variations of a given path 
between these points, the statement that the enclosed region G contains 
no singularities is seen to be equivalent to stating that none are en- 
countered in the process of sweeping one of these paths across the region 
G into coincidence with the other. 

The result 59 is known as the Cauchy integral law, which states in 
effect that the integral of a function of a complex variable between two 
given points in the complex z-plane has a unique value (subject, of 
course, to the restriction that any two chosen paths enclose a region in 
which the function is regular). The nec- 
essary conditions to insure this result are 
expressed by the Cauchy-Riemann equa- 
tions which at the same time insure the 
uniqueness of the derivative of. a given 
complex function. 

This result is so important for all the 
subsequent discussions that it is well to 
consider in another way the relations 
leading to it. Figure 3 shows a differen- 
tial rectangle in the s-plane with its iis. Aa Senet 
center located at some point z. The mid- path’ ins the: cansideetion “ol 
points of the sides of this rectangle are Cauchy’s integral law. 
denoted by a, 6, c, d. A given function 
has the value w = f(z) at the point z and is there assumed to be regular. 
Except for differentials of higher order, its values at the points a, b, c, d 
are given by 


fa) =w+ 2% [60] 
f(b) =wt i [61] 
fe) =w- LE [62] 
fd) =w- w- Tie [63] 


Now since the function w satisfies the Cauchy-Riemann condition 
equations, its derivative is independent of the direction of the increment 
in the g-plane, so that 


af of df 64] 
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and hence 
fe)=wt Ee [65] 
fo) =wt at 2 [66] 
fe =»- 22 [67] 
fd) =» - G78 [68] 


The integral of the function w formed (in the counterclockwise direction) 
for the differential rectangle is given by the summation 


fife) ds = flay dy — f(b) dx —fQjdy + fd) dx [69] 


y 


HO ee 


Fic, 4. Approximation of a contour by a rectangular step curve. 
Substituting from Eqs. 65 to 68 shows that 
fire) dz =0 [70] 


As illustrated crudely in Fig. 4, a given closed contour C of finite size 
may be thought of as approximated by a rectangular step curve. The 
approximation becomes better and better as the size of the steps is made 
smaller and smaller. The integral around the closed contour C may be 
replaced by the sum of integrals around all the enclosed small rectangles 
(all taken in the counterclockwise direction) because the contributions 
from the sides of adjacent internal rectangles cancel, just as in the 
argument leading to the result known as Stokes’s law in vector analysis. 
If the function w = f(z) is differentiable at all points within the region 
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G, the result 70 is applicable to all the enclosed small rectangles, and thus 
the result 59 is again established. The Cauchy-Riemann equations. to- 
gether with the condition that w be differentiable at all points within the 


Fic. 6. A multiply connected region. 


enclosed region are again found to be necessary and sufficient for the 
validity of Eq. 59. 

The following remarks about the characteristics of the region enclosed 
by the contour C are necessary. This contour may conceivably have a 
form like that shown in Fig. 5, for which the enclosed region G is the 
shaded area. If the portions of the contour C leading to and from the 
smaller islandlike regions G,, G2, and G3 within G are moved closer and 
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closer together until they finally become superimposed, the contour 
integral around C evidently becomes equivalent to the sum of four 
separate integrals evaluated respectively for the contours C’, C1, C2, and 
C3 with directions of traversal as indicated in Fig. 6. 

So long as w = f(z) is regular within the shaded region G, the integral 
law 59 still holds when evaluated for the contour C of Fig. 5 or for the 
equivalent set of contours C’, C1, Ce, and C3 shown in Fig. 6. It may no 
longer hold, however, if applied only to the contour C’ of Fig. 6. Actually 
the enclosed regions G;, Gz, and Gz may contain points at which the 
function w = f(z) is not differentiable and therefore, the integrals per- 
taining to the separate contours C}, Ce, and Cz are not necessarily zero. 

A region which has embedded in it one or more subregions Gi, Ge,+* 
like the region G of Fig. 6 is said to be multiply connected. It is doubly 
connected if it contains one subregion, triply connected if it contains two 
subregions, etc. Unless the contour C in the integral 59 is interpreted in 
the manner illustrated in Figs. 5 and 6, the validity of Cauchy’s integral 
law evidently requires that the region enclosed by the contour be simply 
connected. 


7. CAUCHY’S INTEGRAL FORMULA 


A given function is assumed to be regular within the region enclosed 
by the contour S shown in Fig. 7. It is also assumed to be regular at all 
points on the boundary S. These 
points are denoted by ¢, and the 
corresponding values of the func- 
tion are expressed by 


w = f(s) [71] 


A contour integral is now con- 
sidered which has the form 


f(g) df 
ges [72] 


Here z denotes some point within 
Fic. 7. The region of analyticity in the the enclosed region. If ¢ is for 
derivation of Cauchy’s integral formula is the moment regarded as capable 
the boundary S a hie region enclosed of assuming any value within the 

ae: enclosed region as well as those on _ 
its boundary, it is observed that the integrand as a function of ¢ is reg- 
ular at all such points, with the exception of the point ¢ = z. At this 
point the integrand becomes infinite, and therefore Cauchy’s integral 
law no longer applies; the value of the integral 72 is then not neces- 
sarily zero. 
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However, if the point z is surrounded by a circular contour C, and the 
integration is extended over a closed path which consists of the curve S 
and the circle C joined by a line which is traversed in both directions, as 
shown in Fig. 8, the result is evidently zero because the corresponding 
enclosed region is the shaded area -where the integrand is regular. It 
follows, therefore, that the value of the integral 72 is equal to that of 


f(g) dg 7 
fb. (¢ — 2) [73] 


Fic. 8. The shaded region within which Bits) is regular. 


(o) 
f-2 


in which the circular contour C is traversed in the counterclockwise 
direction, and ¢ now refers to points on this circular contour. The correct- 
ness of this statement is recognized from the fact that the difference 
between the integrals 72. and 73 is the integral around the composite 

boundary of Fig. 8, which has the value zero. 
The radius p of the circle C about z is now to be thought of as being very 
small, so small in fact that throughout the process of carrying out the 
integration of 73 the value of f(¢) differs from that of f(z)by a negligible 
amount. In view of the fact that {(¢) is a continuous function within the 
region bounded by S, this is a permissible assumption. (It would obviously 
not be permissible if f($) were discontinuous at the point z.) Now if f(¢) 
in the integral 73 is replaced by f(z), it may be taken outside the integral 
sign, because the integration is evaluated with respect to the variable ¢. 

Thus one has 

fo) a 7 f(s) df = f(z) dg 
s(¢—z) de (t—2) e(¢ — 32) 

With reference to Fig. 7, the polar form of (¢ — z) is 


(¢ ~— 2) = pe’? [75] 


[74] 
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and since ¢ is constrained to lie on the circle C, 


dg = jpe’® do = 7(¢ — 2) do [76] 
so that 
qt 
Gay 7% WH 
Hence 
dt fg. 
fea meas ue) 
and consequently Eq. 74 yields 
f(g) ds ch 
= 2 
by 7 aif) [79] 
or 
1 gp fa 


This result is known as Cauchy’s integral formula. It enables one to cal- 
culate the value of a function of a complex variable at a point within a 
region in terms of its values on the boundary, provided the function is 
known to be analytic throughout the region inclusive of the boundary. 


8. THE EXISTENCE OF DERIVATIVES OF ANY ORDER 


By means of Cauchy’s integral formula, Eq. 80, it is possible to show 
that a function of a complex variable possesses derivatives of any order 
at a point where the function is regular. The familiar formula for the 


derivative of a function w = f(z) reads 


dw... [fe + ds) =£0) 
dz tim | Az [81] 
According to the formula 80, 
1 d 
f(z + Az) get aes _ fod [82] 


2aj Js §-— (3 + dz) 
Substitution of Eqs. 80 and 82 into Eq. 81 gives 


de _ mit | f LO 1 - a] 
oo es = Yi eG ay [83] 


Now 
j 1 Az 


t—(@+Az) ¢—-8 (F—2)? — Az (f—2) [84] 
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so that the limit 83 yields 


dw 1 ¢ifeas 


de Inj Js (F ~ 8) [85] 


The second derivative, according to the form of Eq. 81, is expressed by 


fw, [se + Az) =F) 
ae wm [AOS [86] 
in which the prime denotes the first derivative. By Eq. 85 
1 f(s) dg 


f'@ + Az) "adele Gear [87] 


and Eq. 86 therefore gives 
d? 


eer ar 9g C2) ee a 
Ge ~ limit | 5 S Ag preter ea] 188] 


But 
1 — 1 _ ae = 2) Az + Ae? [89] 
[s-@t+ As)? (¢-—8)? (— 2 — Az)?(¢ — 2)? 
so that the limit 88 is found to yield 
2, 
Pw 2 ¢ fog [90] 


dz? Axj Js (¢ — 2)8 


Continuing in this way, one establishes the following formula for the 
nth derivative: 


aw _ mt ¢ fede 
dz” _ rau (ee g)rtt [91] 


It may be concluded that the function w = f(z), if regular within a given 
region, possesses derivatives of any order for all points of this region, and 
these derivatives may be obtained through differentiating under the 
integral sign in the Cauchy integral formula 80. The existence of but a 
single derivative thus implies the existence of all subsequent derivatives! 
The elegance of this remarkable result can hardly be overemphasized, 
nor is it out of place to call attention to the fact that a similar result does 
not apply to real functions. 

The values of the derivatives of higher order are seen to be unique if 
this is true for the value of the first derivative of a given function. The 
first and all higher order derivatives of a function of a complex variable 
are themselves functions of a complex variable, and these derivative func- 
tions are analytic at any point where the given function is analytic. 
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9. PoINT SETS AND INFINITE SERIES 


Preliminary to the discussion of the expansion of functions in infinite 
series, which is given in the following article, it is helpful to point out a 
few fundamental principles regarding the convergence of such series. 
This subject may best be approached through considering an infinite set 
of points : 


Si, Sa, S3,7¢* Skye [92] 


Visualize the quantities 51, s2, etc., as points in the complex plane. They 
may be scattered about completely at random, or the sequence of their 
values may exhibit a tendency to become confined within a more and 
more limited area or region as the index & (identifying individual values 
in the sequence) becomes larger and larger. In the latter case the sequence 
is said to approach a limit and the set of points is said to possess a limit 
point. 

If one considers a number of concentric circles (with finite, nonzero 
radii) drawn about the limit point S in the complex plane in which the 
points 51, sg,.-+- are indicated by dots, each of these circles, however 
small, contains an infinite number of points. This statement is the defini- 
tion of a limit point. Considering a particular circle of radius €¢, it is possi- 
ble to state that some integer exists such that all points s, for k > n 
lie inside this circle. For any given radius e, the appropriate value of n 
must be sufficiently large to assure that none of the points in the unending 
SEqueNCE Sn41, Snt2, Sn43 °° lie outside the circle (although some s; for 
k < nmay still be inside); or, for a given integer 7, the appropriate radius 
e must be sufficiently large. As the value of 1 is chosen larger and larger 
the appropriate € may be chosen smaller and smaller since the points be- 
come denser and denser in the more immediate vicinity of the limit 
point. The latter, for this reason, is sometimes also referred to as a 
cluster point or as a point of condensation. 

It thus becomes clear that a limit S of the unending sequence sj, sz, 53, 
_ *++emay be said to exist if 


|S — | <¢, foralk>n [93] 


in which ¢ is nonzero but may be chosen arbitrarily small, and » is a 
finite integer depending upon e. If this condition is fulfilled, the sequence 
is said to converge to the limit S. 

) An important result known as the Bolzano-Weierstrass theorem follows 
‘from the definition of a limit point. This theorem states that if an unend- 
ing sequence of points 51, 52, 53, - - - is confined within a finite region, that 
region must contain at least one limit point. The truth of this statement 
may be seen through considering the region to be subdivided into a finite 
number of smaller ones, for example, squares. Since the number of points 


Art. 9 POINT SETS AND INFINITE SERIES 277 


is infinite and the number of squares finite, it is clear that at least one of 
the squares must contain an infinite number of points. This one may 
again be subdivided into smaller squares and the same reasoning repeated. 
Through continuing in this way one may state that within the original 
region it must be possible to find at least one nonzero but arbitrarily 
small subregion containing an infinite number of points, whence, accord- 
ing to the definition of a limit point, that subregion must contain one. 

The convergence condition 93 may be expressed in an alternate form, 
known as Cauchy’s principle of convergence, which reads 


ISntp — Sn| <€, fora finite ~ and all p = 1,2,--- © [94] 


According to this statement, a circle drawn about s, with the finite 
radius € contains all s, for k >; that is, it contains the unending 
sequence Sn41, Sn+2, Sn43, ***, and in view of the Bolzano-Weierstrass 
theorem it must, therefore, contain a limit point. 

Consider now the infinite series — 


S= ZX tn = ty + te + tg tees [95] 


and its periial sums 
SE = Uy + Ua + ug + +++ + Uy [96] 


For k = 1, 2, 3, --- these partial sums may be regarded as elements of 
the unending sequence 51, Sg, S3, -- + discussed above, and S as its limit. 
If this limit exists, the infinite series 95 is said to converge; if the limit 
does not exist, the series diverges. Cauchy’s principle of convergence 94 
may: be expressed in the form 


\tnpatUnret ; ++ +Unip|<e, for a finite m and all p =1,2,--- [97] 


The quantity |.S — s,| appearing in the condition 93 is the absolute value 
of the remainder of the series 95 after the £th term. 
Beside the series 95 it is significant to consider the one expressed by 


o = & [eal = lesa] + al + baal + +> [98] 


in which the terms are the absolute values of corresponding ones in 95. 
Since the remainder 


. [tnpa + Unpo + ngs +---| [99] 
of the series 95 is always smaller than the remainder 
lent] + feental + feemssl --- [100] 


of the series 98, it follows that 95 is surely convergent if 98 converges. 
In the latter event the series 95 is said to be absolutely convergent. 
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A sufficient test for the absolute convergence of the series 95 is ex- 
pressed by the condition 


limit |u,[1/" <1 [101] 


To prove this statement one should first observe that no matter how the 
limit of |~,|!/" is approached as 7 is allowed to become larger and larger, 
whether monotonically from above or from below or in an oscillatory 
fashion, the fact that the limit is less than unity enables one to state that 
there must always exist a finite value of m such that for k > 1, 
|u,|'/* < p < 1. Denoting the partial sums of the series 98 by on, one 
may write 


lontp—on|= letna| + |engel + : + |unsp| Sot +p"*?+- et prt? [102] 
or 


atl (1 —p”) 


rel 
\ontp—on|Se"* (1t+p+p?+: + --+p?*) =p <2 


(1—p) “1—p 
If one chooses p”+!/(1 — p) = €, then € can be made arbitrarily small 


through the choice of a sufficiently large m. Therefore the condition 101 
leads to the result 


lontyp — On| <€, forafinitewandallp=1,2,---« [104] 


[103] 


which is Cauchy’s condition 94 for the convergence of the series 98. 

If |u,|!/" approaches a limiting value p > 1 as m becomes larger and 
larger, the series 95 diverges since |\ttn| — ” > 1 for large x, in violation 
of the obvious necessary convergence condition 4, — 0 for” — o. 

Sometimes the test 101, for one reason or another, is not applicable and 
one must employ other means for examining the convergence of a series. 
A useful alternate method is the d’Alembert ratio test which is expressed 
in the statement: If in addition to |u,| > 0 for n > ©, one has 
UKLI 


<p<1foralk >n [105] 


the series S, Eq. 95, is absolutely convergent. 
To prove that this condition insures the fulfillment of the Cauchy 
condition 94, one may begin by observing that 


|nsi| < |enle 
Jenteal < |tnsile < |#n|o? 


[ents] < |etnsile? < [unlo® 
etc. [106] 
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Hence 


Jessa eal letal+ Hina +--+limtal <lealit athe ohoe) 
107 


or 


p- prt p ; 
lonsp - on < [ttn x dee < {tn Toy [108] 
Since |u,| — 0 for 7—» ~, one can always find a value of beyond 
which |#,|p/(1 — p) is equal to or less than an arbitrarily small nonzero e. 
Hence the Cauchy criterion is met. 

Another useful method is the so-called comparison test, according to 
which the given series 95 is compared with another series 


a Soe ea ee may [109] 


Thus if 5 is known to be absolutely convergent and |u,| < Clv,|, where 
C is any finite positive constant, the series S must be absolutely conver- 
gent. The series 5 is referred to as the dominant of S (or is said to dominate 
the series S). 

Trivial as this test may seem to most engineers, it is nevertheless very 
useful, particularly when both the preceding methods of testing for 
absolute convergence fail. An example is furnished by the series 


balge F 1 1 1 
<2 titgtg tet [110] 
Here : 
rin 
- [ent = () — 1 with 1 — for all finite r [111] 


This last result may be seen by observing that 


In Gy" =f G In ‘) [112] 


Since the quantity in the parenthesis becomes zero for n — © the result 
111 follows at once. It is clear that the condition 101 is not met for any 
finite nonzero 7, and yet one cannot say with certainty that the series 
diverges, since inequality 101 is a sufficient, but not necessary, condition 
for convergence. ° 

The criterion 105 for the d’Alembert ratio test likewise leads to an 
indecisive result, namely, one has 


Unt 
‘thn 


= (+ :) —1 forn— [113] 
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The convergence of the series 110 may be investigated through first 
determining another series which dominates it. To this end one writes 


1 
a +z) +(3 teats rn) 


n=10 
(= + eae aes cee ees ate [114] 
8” 10° + 11" 12" 13” iat ist 
Now 
1 1 1 1 2 ie 
G+) a a ai 
1,11 1N CAL ire 
(Gteteth) <er cae, 
1 1 1 8 — 1—r)\3 
(Gtgtctig)<gr@” [115] 
and.so forth. It is clear, therefore, that the series 
S= EM Part Ayr Qt. — [it6] 


dominates the series 110 or 114. The right-hand side of Eq. 116 has the 
form of the power series 


S=Ustaitst etree [117] 
for which |1,|!/" = |z|, so that the condition 101 reads 
jz| <1 [118] 
With regard to the series 116 the corresponding condition leads to 
|a—"| <1 or r>1 [119] 


When this condition is fulfilled, the series 110 converges. 
For r = 1 this series reads 


Sol 
eee ae ag ee [120] 


and 


1 1 1 
Snip — Sul = ep who ae rear [121] 
For no finite x, however large, can this quantity be less than an arbi- 
trarily small (nonzero) ¢€ for all p = 1,2,--- ~, since for p > mits value 
approaches unity. Therefore the series is divergent for r = 1; and it is 
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certainly divergent for smaller values of r since the terms in the corre- 
sponding expression 121 then are all larger than they are for 7 = 1. 
Absolutely convergent series have the important fundamental property 
that their values are unaltered through a rearrangement of the terms. A 
convergent series that is not absolutely convergent is referred to as being 
conditionally convergent. The value of such a series can very definitely 
be altered through a rearrangement of its terms. In fact Riemann has 
shown that a conditionally convergent series can be made to have any 
finite value for its sum through an appropriate grouping of the terms. 
For example, the series 


1 n—l 
cs clr 1-$+3-a2t+¢-+: [122] 


-3o 
is evidently not ere convergent since the sum of the absolute 
values of its terms is the series 120 which is divergent. It is not difficult to 
see by inspection, however, that the series does approach a finite limit 
in an oscillatory fashion. This limit may be computed to any desired 
accuracy, taking the terms in their given order. One thus finds 
S = 0.69315 - - - which, by other means, may be shown to be the decimal 
fraction approximation to In 2. If the terms are now rearranged as follows 


S=t+$-44+4$4+4-E4+3+n-Gt+--- [123] 
and the limit is again computed, one finds 
S = 1.03972--- = 1.51n2 


When a given series is not absolutely convergent, its possible condi- 
tional convergence may be investigated in the following way. Let the 
series be written in the form 


S = ay AnVn [124] 


in which a, and v, are any two parts into which the typical term of the 
series is separable (one may need to make several trials at this step in 
order to find an appropriate separation). Defining the quantities 


91 =a 
So = a1 + ae 
33 = a + dg + a3 [125] 


one may rewrite the series 124 as 
S = S,(t1 ~ v2) + S22 — 0) + S33 —%) +-++ [126] 
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with the partial sums 

Sn = Sy (01 — 02) +82 (Ve— 23) + + +SEn—-1(On—1— Pn) HSM [127] 

Thus one finds | 
Sep — Sn| = |—Snte + SnlOn — Mai) +° °° 

+ Sn4p—t (Ontp—1 = Dice) + Sniptnp| 

<|S,| i |» | aE |Snl : lon fa Un ea 


+|Sntoal + [ento-a — Mntol + [Sato] - [ental 


[128] 


Now if 
|Sz| = lar + ae +ag-+++++a,| <A forallé [129] 
with A finite, 
ISntp — Sn] <Aflon] + |omtel + (lam — Ampa] + [enta — Mal 
+++ |Pte1 = Mntol)} [130] 


Next, if 

o = D len — 41] is convergent [131] 

i 

so that 
|ong-p— On| =|?2—Pngs|t+|Png1—mpolt +> +[empp1—Mmtol|<er [132] 
for a finite ~ and all p = 1, 2,--- «©, and if further 

lOn4p| < € for all p = 0,1, 2,--- [133] 
the condition 130 becomes 

Snip — Sn| < A(e + 2) =€ [134] 
for a finite 2 and all p = 1,2,--- ©. The series 124 is then convergent, 


and the conditions 129, 131, 133 constitute a test (known as the Dedekind 
test for conditional convergence) which may be used to reveal this fact. 
As an illustration the method may be applied to the series 122. Here 
one may let 
1 


= (-—1)= = 
a, = (-1)""7, My = [135] 


Then 5, = 1 or zero and hence remains bounded for all values of x 
(condition 129). Terms in the series 131 have the form 
i Us eens ae 

~ n(n + 1) 


lon — Oma] = [136] 


n n+l 
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This typical term is smaller than 1/n?, and since the series with 1/n? as 
its typical term is convergent, the comparison test reveals the convergence 
of the series 131 in the present example. Finally the condition 133 is 
evidently met since v, in 135 approaches zero monotonically with in- 
creasing 7. Thus the conditional convergence of the series 122 is proved. 

In the majority of problems in which infinite series appear, the terms 
of the series are functions of some independent variable, and its sum is 
likewise regarded as a function of this variable. For example, in the power 
series 117 the quantity z may be assumed to have any complex value. 
It does not necessarily follow, however, that the infinite series represents 
a function of this complex variable since values of the series are not 
related to values of the variable unless the series is convergent. 

If for all points of a given region in the z-plane the series is convergent, 
it is said to be uniformly convergent in this region, and the series may 
then be regarded as there representing a function of the complex variable 
zg. As the condition 118 shows, the power series 117 is convergent for all 
points lying inside the unit circle about the origin. This is its region of 
absolute and uniform convergence. If the power series is written 


S= Zone” = dy + a + az” + gz +--> [137] 
then the convergence condition 101 becomes 
limit |an|""*|z| <1 [138] 
or 
[z| [139] 


< limit Ja," 
no 


This series converges absolutely and uniformly within a circle about the 
origin having the radius 
1 


” limit lanl? !* [140] 
This circle is called the convergence circle (also circle of absolute convergence) 
of the power series 137. 

For points outside the convergence circle the series diverges, for there 
the inequality in the condition 138 is reversed. The series may still con- 
verge for some points on the circle, but for at least one such point the 
series must be divergent otherwise the radius of the convergence circle 
could be chosen larger than the value given by Eq. 140, and this conclu- 
sion is in conflict with the condition 138. 

The statements just made with regard to the power series are unaltered 
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if the variable z is replaced by (z — 29) except that the convergence circle 
is then centered at the point z = 2p. 

To say that an infinite series converges uniformly at some point z = 21, 
is equivalent to saying that the series converges not only at this point 
but also for all points within a circle about z, with a nonzero but arbi- 
trarily small radius. When this condition is met the term-by-term deriva- 
tive of the series yields a resultant convergent series that correctly 
represents the derivative of the function defined by the given series, and 
an analogous statement may be made with regard to term-by-term 
integration. Within a region of uniform convergence one may, therefore, 
carry out the differentiation or integration of a function defined by an 
infinite series through applying the identical process to each term and 
summing afterward. The resulting series, however, may or may not 
possess the same region of uniform convergence. 

Power series have the interesting property that their term-by-term 
derivative or integral yields resulting series with the same region of uni- 
form convergence. This fact may easily be proved. The series obtained 

_ through differentiation and integration of 137 are respectively 


w _ F n—1 ep. ott 
S 2 nus and S ZF +1? [141] 


The condition 101 applied to these series reads 


Qn 


1/ 
limit |na,|1/*|2[3-0/™ <1 and limit 5 "elton <1 [142] 
n> 0 no 


Since 
1/n 


=1 [143] 


limit |n|!/" = limit 


ne n> 0 


1 
+1 
(as may be seen through considering the logarithms of the expressions 
subjected to the limiting process n—>~ ) it becomes clear that the con- — 
ditions 142 coincide with the condition 138, and hence that the series 
141 have the same convergence circle as the series 137. 

These matters pertaining to uniform convergence of series may be 
illustrated through considering the straight-forward expansion of some 
simple algebraic functions. For example, if one divides (1 — z) into unity 
by continuing the process of long division, there results 

or iteteteadtte. [144] 
which is the power series 117. For values of z within the unit circle this 
series identically replaces the function 1/(1 — z), whereas for values of 
z outside this circle the series and the function 1/(1 — 2) have entirely 
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different values. Near the point z = 1 both the series and the function 
have values that increase without limit, whereas at the point z = —1, 
Eq. 144 yields 


gid ogni ttt [145] 


The series is evidently divergent at this point, for it fails to approach a 
limit. Hence there is no reason to become puzzled about this result since 
the function and the series may be identified only where the latter 
converges. 

However, in view of the fact that conditionally convergent series, 
through a, particular arrangement of their terms, may be made to yield 
any desired sum, one is led to inquire: whether a divergent series like the 
one in Eq. 145 may nevertheless be summable through the application 
of some special process. 

This question, in the past, interested a number of mathematicians — 
Cesaro, Hélder, Euler, Abel, Borel, etc. — and the summation procedures 
developed by them are connoted by the letters C, H, E, A, B, etc., 
respectively. Thus if, for example, a particular series is summable through 
applying the Cesaro procedure, one abbreviates this statement by saying 
that the series is C-summable. This particular process of summation is 
now discussed in somie detail. 

With reference to the series 95 and its partial sums 96, consider the 
sequence - 


Si SS = ty 


Sits. u 
ne ia a 
2 1 
Ss tuts = ty + Fla + 5 ts [146] 


s,-8F tt is (1 tu t (1 = Vg toes tt 
n Nn. n 


which is referred to as an arithmetic mean sequence of the first order (if 
this sequence is subjected to the same process the result is said to be of 
Second order, etc.). A limit of the sequence 146 exists so long as Sp 
remains finite for unlimited values of m. It is not necessary that the terms 
of the series 95 have the property u, — 0 for m —> ©, but, if they do, one is 
led to the result 


limit {us + (1 _ ;)u + (1 — ust free tr 7 | = limit (sn) [147] 


n—> 2 


That is, the new sequence then apparently has the same limit as the 
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sequence of partial sums s,, a conclusion that should, however, be ac- 
cepted with obvious reservations. 
For the series in Eq. 145 the partial sums are 
$, = 1,52 = 0,53 = 1,54 =0,--- [148] 
from which it is clear that 


limit (S,) = li 


no nN 


, [Sit seter > tSe| _ 1 
ae. le 


Thus the left-hand member of Eq. 145 is interpreted as the Cesaro sum 
of the divergent series on the right. This type of summation will be 
used in connection with Fourier series in the following chapter. . 


10. TAYLOR’Ss AND MACLAURIN’S SERIES 


The point of departure in the present argument is Eq. 79, which is 
repeated below: 
f (5) ds 
c (§ — 3) 
Here the contour integral is assumed to be evaluated for a circular path 
C with its center at some point zo. Writing 
1 1 


= 2njf(z) [150] 


SS 151 

g-—2 ($= 4%) — @ — 40) ae 
one obtains, by the process of long division, the series 
= _ 2 

1 1 Z— Zo Pe Zo) sin A [152] 


——— = —— 4+ — 5 

t-2 $-—% | (§— 4%)? (— 40)” 
which, according to the theory of infinite series discussed in the previous 
article, is known to be uniformly convergent for 


tame [153] 


£ — 20 


Since the center of the circular contour C lies at the point 29, and ¢ lies 
upon the circle, this condition for uniform convergence is fulfilled for all 
points z within the circle. Because of its uniform convergence, the 
series representation for 1/(¢ — z) may be substituted into the integral 
Eq. 150, and the integration carried out term by term. This process gives 


TY oe as a (Oa 
anif(e) = f. (F —%) > aI (¢ — 20)? 


+ E= ny f LOS +++ [154] 
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A series representation for the function w = f(z) is thus obtained which 
reads 


f(z) = ao + a4 (2 — 20) + a2(z — 2)? + a3(z ~ 2)? +--- [155] 
with 
lk fo) a 
= Gb a a [156] 


Equation 155 is recognized as Taylor’s series representation for the 
function f(z) in the vicinity of the point 2, and Eq. 156 yields the 
coefficients for this expansion. Since the validity of Eq. 150 requires 
that the function f(z) be regular throughout the region enclosed by the 
contour C, and the series 152 or 154 is uniformly convergent only for 
points within this circular contour (because of the condition 153), it 
follows that the Taylor series 155 converges uniformly only within a circle 
about zo whose contour reaches to the nearest singularity of the function 
w = f(z). 

It is useful to note that, by means of Eq. 91, the relation for the 
coefficients of the Taylor series expressed by Eq. 156 may be written in 
the alternate form 

1 (df 
a= n} dz” Z=2 [157] 
which is recognized to agree with the familiar form used for the expansion 
of functions of a real variable. The Taylor series is thus found to be avail- 
able in unaltered form to functions of a complex variable. The Maclaurin 
series evidently applies also to functions of a complex variable, since it 
is identical with the Taylor series for the special case zp = 0. 

If the Taylor series, Eq. 155, is differentiated term by term, it is 
observed, according to the formula 157, that the coefficients in the re- 
sulting series are those for the expansion of the function df/dz about the © 
point z = 29, and hence that this result represents the Taylor-series for 
the derivative of f(z). The truth of this statement may be shown through 
starting with Eq. 150, differentiating under the integral sign (as per- 
mitted according to the conclusions of Art. 8), and substituting the square 
of the series 152 for the quantity 1/(¢ — z)?, obtaining in place of Eq. 
155 a series for the function df/dz. - 

One may conclude from this reasoning that the term-by-term dif- 
ferentiation of the Taylor series is permitted and yields the Taylor series 
representation for the derivative of the given function. Moreover, the 
resulting series has the same region of convergence as that for the given 
function, as may be seen from the discussion in the previous article or 
from the fact (brought out in the closing sentence of Art. 8) that the 
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derivative of a function of a complex variable is again a function of a 
complex variable and possesses the same region of analyticity. From an 
obvious continuation of the same argument one sees that these conclusions 
apply to derivatives of any order. 

Collateral to these considerations is the fact that the Taylor expansion 
of a given function is unique. To demonstrate this fact, assume that a 
series representation having the form of Eq. 155 is given, and that this 
series is somehow known to represent the function f(z) within an arbi- 
trarily small region in the immediate vicinity of the point z = 2. The 
given series must be the Taylor expansion for f(z) about the point z = zo, 
because the behavior of the function in this vicinity completely de- 
termines its successive derivatives there. That is, differentiating the given 
series and substituting the results into Eq. 157 demonstrates that the 
coefficients are identical with those of a Taylor expansion about the same 


point. 


11. THE PRINCIPLE OF ANALYTIC CONTINUATION 


One of the most interesting things about the theory underlying the 
Taylor expansion is that the mere knowledge of a function within an 
arbitrarily small region in the vicinity of some point z completely de- 
termines that function at all other points within the convergence circle 
about that point. The purpose of the present article is to call attention 
to the even more remarkable fact that a function of a complex variable 
is determined throughout the entire z-plane* from a knowledge of its 
properties within an arbitrarily small region of analyticity. + 

The first step in the process of carrying out such a determination is to 
write down the Taylor series about the point z9 where the function and 
its successive derivatives are known. Some other point 2’) within the 
convergence circle for this Taylor series may then be chosen as the center 
about which a new Taylor series is determined. This choice can always 
be made, because the original series may be used for calculating the values 


*This statement is subject to some restriction when the function in question possesses 
what is known as a natural boundary,every point of which is a singularity. In such an example, 
either the function does not exist beyond this boundary or its behavior there is governed by 
an entirely separate definition, in which case it is perhaps more appropriate to say that one 
is dealing not with a single function but actually with two separate functions. In order for 
the reader to appreciate that these ideas are not merely of an academic nature, his attention 
is called to the fact that natural boundaries of the sort referred to here do occur in practical 
problems. For example, the functions representing the fields in the cross-sectional plane of 
a wave guide possess-as their natural boundary the walls enclosing these fields. 

jIt is not even necessary to know the function at ali points within an arbitrarily small 
region about 9; it is sufficient to know the values of the function, in the interior of the region, 
for all points of a finite but arbitrarily small line segment, or in a set of discrete points which 
have a limit point. 
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of the successive derivatives of the function in the point 2’. Unless every 
point on the original convergence circle is singular, the point z’» can always 
be so chosen that the new convergence circle about 2’ encloses a portion 
of the region lying outside the original circle. Since the two circles partially 
overlap, the two Taylor series have a certain convergence region in com- 
mon where the function is determined by either series. The second series, 
however, enables one to calculate values of the function at points within 
a circumscribed region lying beyond the boundaries which limit the 
representation of the function by means of the first series. One speaks of 
this process as an analytic continuation of the function into the newly 
acquired region. 

By properly choosing a third point 2’’y within this newly acquired 
region and using the second series for the calculation of the successive 
derivatives of the function in this point, one obtains a third Taylor 
series whose convergence circle encloses a portion of the z-plane which 
lies beyond the boundaries given by either of the first two convergence 
circles. The third Taylor series, therefore, represents a further continua- 
tion of the same function. 

In order to obtain such a further continuation it is, of course, not neces- 
sary that the third point 2’’9 be located outside the first convergence 
circle, since the selection of this point may be regarded merely as a re- 
vision in the choice of the second point 2’9. It should be easy to appreciate 
that this procedure may be continued in a variety of ways so as to obtain 
numerous overlapping convergence circles and corresponding series 
representations by means of which the function is ultimately determined 
within any desired region of the z-plane except for the singular points 
of the function. 

The process of carrying out an analytic continuation in this manner ; 
and of obtaining a succession of partially overlapping convergence circles 
which extend the known region for the function into a continuously 
expanding portion of the z-plane may be regarded as a process of succes- 
sively acquiring access to additional area in the z-plane after the fashion 
that a harvester, cutting grain with a scythe, successively acquires 
additional stubble ground by executing a continuous series of semi- 
circular slices. If for the moment one imagines this harvester to be doing 
a rather unsystematic job, it is conceivable that he may cut a swath or 
path which circles about a portion of the grain field and returns so as to 
overlap itself. If the process of analytic continuation is carried out in this 
manner, one expects, in the overlapping portion, to regain the same values 
of the function as were obtained previously. This, however, may or may 
not be the case, for the function may be multivalued, and in returning to 
the original portion of the z-plane one may find oneself located on a dif- 
ferent leaf of the Riemann surface* characterizing that multivalued 

*These matters are discussed in further detail in Arts. 17 and 18, 
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function. In fact it is possible to find, after returning to the original 
region, that a point at which the function was observed to be regular 
when this region was first encountered is now a singularity of the function. 

The significent matter here is that the process of analytic continuation 
is applicable whether the function is single-valued or not, and that even 
a multivalued function with all its manifold characteristics and peculiari- 
ties is completely determined from a knowledge of that function over an 
atbitrarily small region of analyticity on a single leaf of its Riemann 
surface. 

It follows also from these considerations that any two functions of a 
complex variable whose values coincide over an arbitrarily small region 
of analyticity, or for all points of a finite but arbitrarily small line segment, 
or for a set of discrete points having a limit point, must have identical 
values throughout their common region of analyticity and hence must 
there be identical. This statement is known as the identity theorem or 
uniqueness theorem for analytic functions. 

If the common region of analyticity possesses a natural boundary, 
nothing is implied regarding the behavior of the functions within other 
possible regions of analyticity. The theorem likewise does not: imply 
that the behaviors of the two functions are identical at isolated singulari- 
ties located within their common region of analyticity, but these matters 
are practically irrelevant inasmuch as a function is usable only where it 
is analytic, and no difficulty can arise from assuming (if this be desirable) 
that the functions are identical everywhere else. 

The arbitrarily small line segment over which the function is initially 
known may be a portion of the real or imaginary axis in the z-plane. A 
real function of a real variable may be regarded as a function of a com- 
plex variable whose independent and dependent values both happen to be 
on the real axis. If it is possible to continue such a function into the 
complex domain, that continuation is unique and is immediately obtained 
by the simple expedient of replacing the real independent variable x by 
the complex variable z = x + jy. The truth of this statement, expressing 
a property of functions known as their permanence of form, follows di- 
rectly from the identity theorem, since the given real function and its 
continuation obviously coincide for points on the real axis. 


12. SINGULAR POINTS AND THE LAURENT EXPANSION 


Since the singularities of a function of a complex variable are, so to 
speak, the mainsprings upon which its very existence depends, it is usually 
of chief interest in the study of a given function that characteristics of 
the function be investigated in the immediate vicinity of these singular 
points. For this purpose the Taylor series is of little service, because the 
vicinities of singular points are the very regions where its convergence 
fails. Consequently it is of considerable importance to search for a type 
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of expansion whose sphere of usefulness is centered about a singular point, 
that is, an expansion which places the character of a given singularity 
in evidence. 

A few preliminary remarks concerning singularities will be appropriate 
preceding the detailed discussion of how such an expansion is found. 
Quite generally one must admit the possibility that a given function may 
be singular (nondifferentiable or nonanalytic) not only at certain discrete 
points but also at all points comprising a finite region in the z-plane. 
If the latter is the case, and a particular point within this region is 
singled out, it is not possible to discover an immediately surrounding or 
neighboring space, however small, in which the function is analytic. 
In other words, the singular points within such a region are infinitely 
dense. 

For the considerations of the present article, such singular regions 
must be ruled out. Indeed, it is not possible to discover any kind of series 
expansion which can represent a function in the vicinity of a point lo- 
cated within a region of this sort. From a practical point of view this is 
hardly discouraging, however, inasmuch as functions which possess such 
singular regions are seldom encountered in engineering work.* The present 
discussion, then, will.apply only to singularities for which it és possible 
to discover an immediately surrounding region in which no other singular 
points lie. These are called isolated singularities. 

With reference to Fig. 9, 29 represents a point at which a function 
w = f(z) has an isolated singularity. About this point as a center are 
drawn a small circle c and a larger one C. The given function may have 
other singularities within the smaller circle or outside the larger one, but 
it is assumed to be analytic and single-valued at all points such as the 
point 2 inside the annular space between the two circles. Cauchy’s in- 
tegral formula, Eq. 79, is, therefore, applicable to the composite contour 
consisting of the two circles and the path joining them, traversed con- 
tinuously as indicated in Fig. 9. Since the path joining the circles is 
traversed in both directions, the resulting form of Eq. 79 maybe written 


f(§) de as f(9) dg 
c (¢ — 2) e (f — 8) 


in which the larger circle is traversed in the counterclockwise direction, 
the smaller one is traversed in the clockwise direction, and ¢ refers to 
points on either of the two circles. 

For the integral over the large circle, the series 152 for 1/(¢ — 3) 


Qaif(z) = [158] 


*It may be pointed out in this connection that in terms of the analogy to physical flow 
fields (as discussed in Art. 5), the continuously distributed sources or vortexes (spatial distribu- 
tions of electric charge or current densities, for example) constitute just such singular regions. 
However, the solution of practical problems dealing with these situations fortunately does not 
require the series representation of functions within these regions. 
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converges uniformly because the condition 153 is fulfilled. For the integral 
over the small circle, this statement is no longer true, because the point 


oO 


Fic. 9. The composite contour to which Cauchy’s integral formula is applied 
in the derivation of the Laurent expansion. 

z lies outside the small circle and hence |z — zo| > |g — Zo|. Here Eq. 

151 must be replaced by 

1 -1 


fo87 @-m)- €-m) os 
whence a process of long division yields the series 
1 1  .@—%) — &-%)? 
t—-2.  #-% (%—%)* (@—-%)° os 
which is uniformly convergent for 
ae >1 [161] 


It thus becomes clear that for 1/(¢ — z) one may substitute the series 
152 into the first integral in Eq. 158 and the series 160 into the second 
integral, and carry out both integrations term by term. Equation 158 
then becomes 


d 
Bo say pasar IO) ds 
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2 f(s) dt 
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This result may be written in the form 
F(%) = bo + bi(@ — 2) + b2(z — 20)? + bg(z — 2p)? +--- 
+ b_1(@ — 29) + b_o(e — 2)? + bale — 2) FP +--+ [163] 
in which 
1g fa 


"=~ aideG —a) forn = 0, 1, 2, --- [164] 


and 


_-lg_f@a 
2ajJe (¢ — z9)”* 


Equation 163 is the desired series representation for the function w = F(z). 
It is called the Laurent series. From the derivation just given it is clear 
that the series converges uniformly within the annular region between 
the two circles shown in Fig. 9 where the function w is analytic. Thus the 
radius of the outer convergence circle extends from Zg to the nearest 
singularity beyond the circumference of the smaller circle, and the radius 
of the inner convergence circle extends from zo to the farthest singularity 
inside the larger circle. If the singularity at zo is the only one inside the 
larger circle, the radius of the smaller circle may become vanishingly 
small, and the Laurent series is then seen to represent the given function 
in the immediate vicinity of the singularity at zo, that is, to represent an 
expansion of w = f(z) about this singularity as a center. 

In connection with the formulas 164 and 165 for the coefficients of the 
Laurent series it should be observed that the direction of traversal about 
the large circle C in the integral 164 is counterclockwise, whereas that 
about the small circle c in the integral 165 is clockwise. If the latter direc- 
tion of traversal is reversed, the algebraic sign in Eq. 165 changes from 
minus to plus. The formulas 164 and 165 then differ only in that the first 
is evaluated for the circumference of the larger circle and the second is 
evaluated for the circumference of the smaller one. Since the function 
w = f(z) is analytic in the region between the two circles, the values of 
the integrals 164 and 165 are the same for any closed contour within the 
annular region or coincident with either circle. Hence the formulas 164 
and 165 may be replaced by a single one which reads 


_ 1 f(g) dg 
7 aa (f= %)*t 


form = —1, —2, —3,--- [165] 


n (166] 
in which S is any closed contour within the annular region or coincident 
with either circle, ¢ refers to points on this contour, and the latter is 
traversed in the counterclockwise direction. Since the function w = f(z) 
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is singular in the point zo, Eq. 91 does not apply for z = 2 and hence 
there exists no alternative differential formula for the coefficients bn, 
similar to Eq. 157 for the coefficients of the Taylor series. This circum- 
stance is a practical disadvantage because integration is, as a rule, more 
difficult than differentiation. Consequently, when a Laurent expansion is 
to be found, the coefficients are determined, wherever possible, by other 
expedients. Further discussion of this point is given later on. 

When the form of the Laurent series, Eq. 163 is contrasted with that 
of the Taylor series, Eq. 155, the principal difference is observed to lie in 
the fact that the Laurent series contains both descending as well as 
ascending powers of the variable (z — zo), whereas the Taylor series 
contains only the ascending powers. The portion of the Laurent series 
involving the ascending powers only is called the ascending part of the 
series, and the portion involving the descending powers is called the 
descending part or principal part. It is the principal part of the Laurent 
series gvhich places in evidence the singularity of the function w = f(z) 
at the point 20. 

The ascending and descending parts of the Laurent expansion may be 
written respectively as 


fils) = E bale — 20)" [167] 
and 
fale) = bale — to)” [168] 


whence Eq. 163 for the Laurent series becomes 
f@) =fi@) + fe) [169] 


Because the series 152 (which leads to the ascending part f;) converges 
uniformly for all z-values within the larger circle and the series 160 (which 
leads to the descending part fz) converges uniformly for all z-values out- 
side the smaller circle (Fig. 9), it follows that the series 167 converges 
everywhere within the larger circle whereas the series 168 converges 
everywhere outside the smaller one. Both series converge within the an- 
nular region between the two circles (this is the common portion of the 
two separate regions of convergence), and this, therefore, is the region of 
convergence for the sum of the two series 167 and 168, which is the 
Laurent expansion. 

The function f;(z) is analytic everywhere within the larger circle, and 
the series 167 is its Taylor expansion about the point z = Zo. The function 
fo(z) is analytic everywhere outside the smaller circle. To interpret the 
series 168 for f(z), it is helpful to consider for the moment a change of 
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variable which amounts to replacing (z — 20) by w = 1/(z — 2) which 
amounts to interchanging the roles of points zp and «. The series 168 
then becomes one involving only ascending powers of the new variable 
w and represents a Taylor expansion of the function fo(z) about the point 
w = 0 (which corresponds to a Taylor expansion about the point zg = 
according to the change of variable considered). It is helpful in this 
reasoning to think in terms of the complex sphere rather than the complex 
plane, for then the points z = z) and z = & are simply two points on the 
sphere and the interchange of the parts which they play is easier to 
visualize. One is thus led to recognize that the descending series 168 may 
be regarded as a Taylor expansion of the function f2(z) about the point 
Z= 0, 

According to this interpretation the Laurent series represents the given 
function by means of two Taylor series, one about the point z = Zp (this 
is the ascending series for f,) and one about the point z = © (this is the 
descending series for f,). The function f;(z) contains only those singu- 
larities of f(z) which lie outside the larger circle about 20; fo (z) contains 
only those singularities of f(z) which lie within the smaller circle. This 
circle may alternatively be regarded as one which is drawn about the 
Point z = « asa center, whence the region “ within ” the circle (actually 
the region outside the smaller circle) becomes the region of analyticity 
for the function f,(z). The change of variable indicated by (z — 2) > 
1/( — 20) evidently interchanges not only the points z = z) and z = 
but also the roles played by the functions f,(z) and f(z) and their series 
representations as given by Eqs. 167 and 168. 

From the uniqueness theorem discussed in the preceding article it fol- 
lows that the Laurent expansion is unique, since two series representations 
having identical regions of convergence and yielding identical values for 
all points within this region must be identical. This fact has practical 
significance, particularly with regard to the Laurent expansion, because 
it means that if a representation having the form of Eq. 163 is found for a 
given function in the vicinity of one of its singularities, this representation 
must be the Laurent series for that region irrespective of the method by 
which it is determined. In other words, the coefficients need not be 
calculated by means of the formula 166, but may be found by any other 
expedient which proves to be most effective in the given circumstances. 


13. KanDs OF SINGULARITIES AND THE CLASSIFICATION OF FUNCTIONS 
IN TERMS OF THEM 


If the smaller circle c of Fig. 9 encloses no other singularities except the 
isolated one at the point z = zo, the principal part (Eq. 168) of the 
Laurent series (Eq. 163) characterizes the nature of that singularity. If - 
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the principal part contains a finite number of terms, the singularity is 
referred to as a pole. The principal part may then be written 


A(z) = b-4(@ — 20)? + bale — 20)? +++ + be(@ — 20)" [170] 


in which the highest negative exponent s is called the multiplicity or order 
of the pole. A pole of the first order is also called a simple pole. 

The principal part #(z) may have an infinite number of terms, in which 
case the function f(z) is said to have an essential singularity at the point 
Z = Zo. At a pole the function f (z) becomes infinite, but in the vicinity 
of an essential singularity the function may assume any assigned value 
depending upon the manner in which this singularity is approached.* The 
function w = e’, for example, has an essential singularity at the point 
z = of, and the function w = el/*inz has essential singularities at the 
points z = 0, 7, 2x, ---. For the function 


f@ =é=etVo ed (osytjsny) =u tj [171] 


one may recognize this peculiarity by noting that 


e=VeP Te (172] 
and . 
v 
tany=7 [173] 


Consider an arbitrary choice of values « and v. It is then possible to allow 
z to approach infinity along a path, parallel to the y-axis, designated by 
x = In Vu? + v*, since this value of x satishes Eq. 172. Equation 173 
may also be satisfied along this path for an infinite number of values of 
which tend to infinity. Also, if z approaches infinity along the negative 
real axis, e” becomes zero, and if z approaches infinity along the positive 
real axis, e” becomes infinite. 

Not only is a pole a milder form of singularity, but the behavior of the 
function in its vicinity is a very definite one. For a pole of multiplicity s, 
multiplication of f(z) by the factor (¢ — Z)* yields a function which is 
regular in the point z = 29. This circumstance may be regarded as a test 
whereby an ordinary pole may be distinguished from any other kind of 
singularity. 

Other kinds of singularities, particularly those found in multivalued 
functions (branch points), are discussed in a subsequent article. In the 
meantime it is useful to point out how certain types of single-valued func- 

*Theorem of Casorati-Weierstrass. 

{This characteristic of the function e* may be recognized from the fact that the Taylor 
series, which in this case converges uniformly in the entire z-plane, has an infinite number of 
terms.- Note also the discussion of entire functions immediately following. 
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tions may be classified according to the poles or essential singularities 
which they possess. Once more it is emphasized that the singularities of 
a function are the mainsprings of its existence. Without singularities of 
any kind, an analytic function reduces to a constant. 

In this classification one may begin with that type of function which is 
singular only in the point at infinity. Such a function is regular in the en- 
tire finite z-plane. It is called an enéire function or also an integral func- 
tion, and it-may be denoted by J(z). There are two kinds of entire 
functions; for one of these the singularity at infinity is an ordinary pole, 
and for the other it is an essential singularity. The first of these functions 
is more particularly referred to as an entire rational; the second, as an 
entire transcendental function. 

Since the entire function is regular in the entire finite z-plane, it pos- 
sesses.a Taylor series representation for all finite points, which con- 
verges uniformly within the entire z-plane. If the function is transcen- 
dental, such a Taylor series contains an infinite number of terms. The 
functions sin z, cos z, and e* are common examples. An entire rational 
function, on the other hand, possesses a Taylor series representation 
having a finite numberof terms, the highest power of (z — zo) being equal 
to the order of the pole at infinity. The entire rational function, therefore, 
is an ordinary (finite) polynomial; that is, 


T(z) = P(z) = ao + a (2 — 2%) + a2(@ — 2)? +--+ + Gn(Z — 20)” 
[174] 


in which 7 is the order of the pole at infinity. For x = 0, the function is 
also regular at infinity, and in this case reduces to the constant dp. 

A second important class of functions are those referred to as mero- 
morphic. They may be defined as given by the ratio of two entire func- 
tions; thus 


Mee) =P [175] 


Since J2(z) is a finite or an infinite polynomial, it may become zero at a 
finite or at an infinite number of points in the z-plane. If I2(z) is thought 
of as factored in terms of its roots (these are called the zeros of I2), it 
becomes clear that, at these points, M(z) has ordinary poles whose 
orders equal the multiplicities of the roots of I2(z). The factored form of 
Iz(z) is its finite or infinite product representation, and this form places 
the poles of the function M(z) in evidence. 
At the point infinity the meromorphic function has an essential singu- 
larity if either or both of the entire functions J; and J, are transcendental. 
M(z) is then said to be transcendental also, but at no finite points in the 
z-plane can this function have singularities other than ordinary poles. 
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The function w = tan z is a common example of a meromorphic function 
which is also transcendental. It has an infinite number of simple poles in 
the z-plane and in addition has an essential singularity at infinity. 
When both J, and Jz are rational (given by finite polynomials), the 
resultant meromorphic function is also rational. At the point infinity, the 
function then has at most an ordinary pole. In other words, a mero- 
morphic function whose singularity at infinity is an ordinary pole, is a 
rational function. Rational functions, then, are such having mo other 
singularities except poles. Inasmuch as the representation of them is given 


by 


P(g) 
Re) => 176 
in which P, and P2 are finite polynomials, it follows that a rational func- 
tion has a finite number of poles. 


14. ZEROS AND SADDLE POINTS OR POINTS OF STAGNATION 


If in the Taylor series representation for the function w = f(z) as given 
by Eq. 155, the constant term dp is zero but a, is not zero, the function is 
said to have a simple zero in the point z = z9. For the immediate vicinity 
of this point, that is, for (g — zo) < 1, the function is approximately 
represented by the term a,(z — Zo) alone. The reciprocal function, 
1/f(z), has a simple pole in this same point, for its representation in this 
vicinity is approximately given by 1/ai(z — 20). 

If both do and a, are zero and dz is different from zero, the function is 
said to have a zero of the second order in the point z = 2. For the imme- 
diate vicinity of this point the function then is approximately represented 
by the term a2(z — Zo)”. The reciprocal function similarly is approxi- 
mately represented by 1/az(z — 20)” and has a pole of second order in 
this point. 

In general a zero is said to be of the order s if the reciprocal function has 
a pole of order s in the given point. This is the case if the first nonzero 
coefficient in the Taylor series expansion for the function is a,. According 
to the formula 157 for the Taylor coefficients, this condition results if the 
function and its first s — 1 derivatives all vanish at the point z = Zo. 

It is also possible that the first s — 1 derivatives of the function are zero 
at the point z = Zo but that the function itself is not zero there. That is to 
say, all the coefficients a;, d2, and so forth up to and including a,_; are 
zero, but dp is not zero. In this case the function f(z) — a, or w — a, 
has a zero of the order s in the point z = 2p, but the function w obviously 
is not zero there. Except for the additive constant ao, the behavior of the 
function w in this vicinity is, however, clearly the same as though this 
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point were an sth order zero. The terminology used to refer to such points 
must be so chosen, however, as to distinguish them from zeros. For reasons 
of physical interpretation, to be discussed in the following paragraphs, 
they are referred to as saddle points or also as points of siagnation.* 

For the immediate vicinity of a saddle point of the order s — 1, one 
hasf 


w=uUutjv& do + as(% — 20)* [177] 


The point is a zero of order s if ag = 0. The following detailed discussion, 
in which the constant dp is dropped, applies to either zeros or saddle 
points. It is convenient to write 


(2 — zo) = rei? [178] 
whence 
w= ut jv & a,re®? = ar (cos sd +7 sin sd) [179] 
Equating real and imaginary parts gives 
u & a,r* COs S@ . [180] 
v & a,r' sin sd [181] 


For the graphical representation, in the z-plane, of the loci for 
uw = constant and v = constant, according to the discussion given in 
Art. 5, the relations 180 and 181 are helpful in showing the character of . 
such loci in the vicinity of the point zo. 

These loci are shown in Fig. 10 for the cases s = 1,5 = 2, ands = 3. 
The corresponding sketches in Fig. 11 show how the algebraic signs of the 
quantities u and v change in the vicinity of the point z = 2. The pictures 
in Fig. 10 may be regarded as depicting the direction of the gradient 
(wu = constant) and the lines of constant altitude (v = constant) in a 
mountainous terrain. The picture for s = 2, for example, is then seen to 
represent the vicinity of a point which is simultaneously the top of a 
ridge and the bottom of a valley, that is, a mountain pass where a valley 
crosses a ridge. The terrain in such a region evidently has the shape of a 


*The German terminology (of which these are translations) is “ Sattelpunkt” or 
“ Staupunkt.” Alternatively the term “ Kreuzungspunkt ” is also used. 

+The convention of designating the order of a saddle point as being s — 1 when a zero 
having the same properties is referred to as being of the order s arises from the fact that the 
inverse function has a branch point where the given function has-a saddle point, whereas the 
reciprocal function (not to be confused with the inverse function) has a pole where the given 
function has a zero. Just as the order of a pole receives the same designation as that of the 
zero of the corresponding reciprocal function, so the order of a saddle point receives the same 
designation as that of the branch point of the inverse function. These matters are discussed 
in greater detail at the end of Art. 18 after the method of dealing with multivalued functions 
is presented. 
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saddle. This fact accounts for the appropriateness of the term saddle 
point. 

For s = 3 the terrain has the shape of a saddle which might be designed 
for a three-legged person, or one may say that it is a region where three 
ridges and three valleys meet in a common point. 

Using a hydrodynamic analogy, one may regard the curves for 
“ = constant as representing the direction of fluid flow, and the curves 
for v = constant as designating the orthogonal set of contours along 
which the gradient is zero. The fluid is streaming symmetrically toward 
and away from the point 2. At this point the fluid is stagnant, thus 


Fic. 10. Loci of constant real and imaginary parts in the vicinity of saddle points 
of various orders. 


suggesting the term point of stagnation as an alternative designation. 

The pictures show, moreover, that if the order of a zero is greater than 
unity (or that of a saddle point is greater than zero), the orthogonality 
of the contours for « = constant and.v = constant fails in the point 
Z = &o, since the respective curves there intersect at an angle of «/2s 
radians. This failure is, however, only apparent inasmuch as the contours 
actually do not pass through the point but are bent sharply at it. Fluid is 
deflected at the point zo instead of flowing through it. 

From the standpoint of conformal mapping there is also an apparent 
failure in the angular relationships at a zero order greater than one (or at a 
saddle point of nonzero order). If, in addition to the polar representation 
of (g — Zo) given by Eq. 178, one also writes 


WwW — ao = pe’? [182] 
in which ay) may or may not be zero, Eq. 177 shows that in the immediate 
vicinity of the point z = Z one. has 

p=a,7 [183] 
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and 


0 = Sd [184] 


Since (z — 29) may be regarded as a small path increment radiating 
from the point zo in the z-plane, and w may similarly be regarded as the 


\ Gi 


8=3 
oe a 


Fic. 11. Algebraic signs of real and imaginary parts in the vicinity of saddle 
points of various orders. 


corresponding small path increment radiating from the point dp in the 
w-plane, one observes, according to Eq. 184, that if the increment (z — 29) 
is rotated through an angle Ad, the corresponding increment w rotates, 
not through the same angle (as the conformality ordinarily requires), but 
through an angle s times as large. 

This apparent failure in the preservation of angular relationships is 
clarified by the recognition that the inverse function z = ¥(w) is multi- 
valued and that the point z = 2 is a branch point of the order s — 1 for 
this inverse function. The discussion of conformal mapping in Art. 2 
points out that such a pair of maps in the w- and z-planes is a graphical 
representation not only for the given function but also for the inverse of 
it. Consequently one must recognize that although the function given by 
Eq. 182 is single-valued, the multivalued character of the inverse function 
cannot be ignored. 
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Although the detailed properties of multivalued functions is left for 
discussion in Arts. 17 and 18, it is appropriate to point out here how one 
may see that the failure in the preservation of angular relationships is 
only an apparent one. If the point z in the z-plane is enclosed by a 
sufficiently small circle c, the corresponding locus in the w-plane is very 
nearly a small circle c’ surrounding the point wo (which corresponds to 
zo). As a point traverses an arc on the circle c, the corresponding point 
on c’ traverses an arc s times as great. In Fig. 12, a and 6 are two small 
radial line segments emanating from the circle c, and a’ and b’ are the 
corresponding segments in the w-plane. The angle @ between a’ and 0’ is s 


Fic. 12. Apparent failure in the preservation of angular relationships at a saddle 
point. 


times that between @ and b. If the radii of the small circles ¢ and c’ are 
allowed to become still smaller, the line segments appear to radiate from 
the points 29 and wo, and one is led to conclude that the preservation of 
angular relationships has failed because one’s attention is focused upon 
the angles ¢ and ¢ rather than upon the angles between the line segments 
and the circular arcs, which remain equal to 90 degrees. However, if one 
mentally visualizes the situation in the limit as though the small circles 
were still there, it becomes clear that what has happened is not a failure 
in the preservation of angular relationships but rather is the result of a 
peculiar and somewhat misleading behavior of the given function in the 
vicinity of the point z = 2. This view is borne out by the fact that the 
Cauchy-Riemann equations, guaranteeing the uniqueness of the deriva- 
tive, still hold in this point. 


15. THE EVALUATION OF CONTOUR INTEGRALS; CAUCHY’S RESIDUE 
THEOREM 


If the formula given by Eq. 166 for the coefficients of the Laurent 
expansion is written for the integer value m = —1, it reads 


gf (evar = abs (185) 
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The contour S is in the present discussion assumed to enclose a region 
within which the function f(z) has but one singularity at the point zo. 
This situation is indicated in Fig. 13. If the coefficient b_, can be deter- 
mined in some way (for example, as described subsequently in this 
article), Eq. 185 represents a means for evaluating the contour integral 
for the function f(z) extended around a 
given closed boundary S. 

According to the Cauchy integral law, as 
discussed in Art. 6, the value of this contour 
integral is zero if the given function f(z) is 
regular at all points enclosed by the con- 
tour. The present result substantiates this 
fact, for if f(z) is regular also in the point Zo, 
the Laurent expansion about this point has 
no principal part (it becomes identical with 
the Taylor pine and hence b_; = 0. BIG go> “abe shee o 

: function about S is de- 
The present result may be said to repre- termined by the value of the 
sent a completion of the Cauchy integral law residue of the pole at 2. 
in the sense that it yields the value of the 
contour integral whether the function is regular within the enclosed re- 
gion or not, and hence it contains the integral law as a special case. 

An interesting alternative way of obtaining this same result is to begin 
by assuming that the contour integral is given and that an evaluation of 
it is sought. Since the singularity at 29 is the only one within the region, 
the contour S may be replaced by a circle C about 29 according to reason- - 
ing similar to that used in Art. 7 in replacing the integral 72 by the 
integral 73. In other words, the path of integration may be deformed or 
contracted so long as no part of it is allowed to sweep over a singular 
point. Thus 


S$ fG) ds = $f fle) ds [186] 
Now f(z) may be replaced by its Laurent expansion which reads 
fe) = E bale — 20)" [187] 


* Because of the uniform convergence of this series, the integration may be 
carried out term by term, giving ; 


f f(z) dz = =. on f (z — 29)” dz [188] 


If the radius of the circle C about zo is denoted by p, then 
(2 — 2) = pe’ [189] 
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and 
dz = jpe’® do [190] 
Equation 188 becomes 
bed y, an 
§ f(e)dz= XL jbno*t f ert dg [191] 
Ss n=—% 
But 
an 2” forn = —1 
(nt) a 
J a . w= {9 forn # —1 [192] 
Hence 
ff §(@) ds = Injds [193] 


which agrees with Eq. 185. 

The coefficient b_; is called the residue of f(z) in the point 2 = zp. 
The value of a contour integral enclosing a singularity is, therefore, 
equal to 27 times the residue of the func- 
tion in this singularity. This result is re- 
ferred to as Cauchy’s residue theorem. 

When the contour S encloses more than 
one singularity, that is, if the function f(z) 
is singular at several points 2, 22,°° + 2% 
within the enclosed region, as indicated in 
Fig. 14, the contour S may be replaced 
Fic. 14. The integral of a by & separate contours each enclosing one 
function about S enclosing of the singularities, and the value of the 
several poles is determined by contour integral around S is seen to be 

the sum of the residues. : So qie's 

given by the sum of individual contour 
integrals around the k separate contours. It becomes clear that in 
this case 


z-plane 


f, f(z) dz = 2nf (ba + 04 +--+ +b) [194] 


in which b_,™, 61, - - - are the residues of f(z) in the points 21, 22, «- - 
respectively. 

The contraction of the contour S to the several separate contours 
about 21, :, --+ may be visualized through supposing S to be a rubber 
band which is shrunk in the manner indicated in Fig. 15. The contribu- 
tions coming to the net result from those portions of the shrunken contour 
which in the limit become superimposed and are traversed in opposite 
directions evidently cancel. 
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As a practical means for evaluating a given contour integral, of course, 
this method is useless unless some way is found for determining the 
residue, which is the coefficient of the first term in the principal part, 
Eq. 168, of the Laurent expansion. When several singularities are en- 
closed by the contour S, the residues of the function f(z) are usually 
found separately for each of the singu- 
larities. If it is possible to find the first 
term in the principal part of the Laurent 
expansion for the immediate vicinity sur- 
rounding each singularity, this objective 
is accomplished. 

The following method for evaluating 
the residue is useful in many cases. If 
the singularit in Zp is rdin: 
pole, eS Poa i aaa SY Fis. 18. The conte puponoterct 

pole to the integral is accounted 


for separately by shrinking the 
[195] contour. 


$0) = 75 
is regular in z) and may there be expanded in the Taylor series 
$(2) =$(20) +’ (0) - (2-20) +30" (Zo) - (@—-20)’+ +++ — [196] 


in which the primes denote differentiation with respect to 2. 

The detailed process now varies according to the order of the pole of 
f(z) in 2. If this pole is of the first order, ¢(g) = 0, but '(gq) is not 
zero. Division of the series 196 into unity (by the ordinary process of 
or division ) ee 


—1 
=f) =F @-wy-5 
1 (¢’’)? 1 of” 
This is the Laurent expansion of f(z) about the point zo. Hence the 
residue in this case is 
do —1 
64, = (F [198] 


dz},=2 


me oF eo) 


If the pole of f(z) in 29 is of the second order, $(%) = 0 and ¢’ (zo) = 0, 
oe ¢’’ (go) ¥ 0. Division of the series 196 into unity then reads 


att 


=f(@)= o” ar Zo)? = : eae (zg - Zo)" 


CHD bie a SG ae aa 
FG a Gd ee 8 


ro) 
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from. which the residue is seen to be 
[200] 


When the pole is of higher order, the expression for the residue becomes 
increasingly more complicated, but the method of evaluation remains 
the same. 

The detailed aspects of the process of evaluating residues may, of 
course, be varied in a great many ways, and the most expeditious course 
depends entirely upon the form of the specific function in hand. Addi- 
tional ways of approaching the problem may, in the course of the solution 
of specific examples, suggest still other variations. 

If f(z) has an sth order pole in z = 2, then 


¥(z) = (2 — 20)*f(2) [201] 
is regular in this point. Hence it possesses the Taylor expansion 
¥(2) = (20) + ¥’ (Zo) + &@ — 20) + 34" @o) + @ — 20)? +--+ [202] 
Since 
fe) = 28, | [203] 


substitution of the Taylor series 202 for ¥(z) yields the Laurent expansion 
of f(z) about zo. It is then clear that the residue of f(z) in 2 is given by 


tke Se OV 
reer (a3 ae pon 


Sometimes the function ¥(z) in Eq. 201 is more conveniently expressed 
as the product of two simpler functions: 


¥(z) = &(z) - n(z) [205] 
Suppose that 
E(2) = ag + a1(% — %) + a2(% — 20)? +--> [206] 
and 
n(z) = Bo + Bi(z — 2) + Ba(2 — 20)? +-- [207] 


are the Taylor expansions for these functions about z = zo. Since the 
residue of f(z) in Zo is the coefficient of the term containing (2 — Zo)" 
in the product é- n, it is seen that_ 


by = ao8s—1 + O1Bs-2 +°°+ + os_1B0 [208] 


In numerous practical problems the function f (2) in the integral 185 
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has a finite number of ordinary poles in the finite z-plane, all of which are 
enclosed by the boundary S. At the point infinity, the function may or 
may not be singular. If the complex plane is thought of as replaced by the 
surface of the complex sphere, it is possible to regard the contour S either 
as one enclosing all the singularities of f(z) which occur for finite z-values 
or as one which encloses none of these singularities but merely surrounds 
the point at infinity. In other words, the region which ordinarily is re- 
garded as being external to the contour S may alternatively be inter- 
preted as being the enclosed region. The latter is, of course, traversed in 
the opposite sense, but for the moment this fact is of secondary impor- 
tance. If'it happens that the function f(z) is regular at infinity, one is 
confronted with the peculiarity that the integral around a contour 
enclosing no singularities is nevertheless not equal to zero. The unique 
feature about this situation, however, is that the region in question 
contains the point at infinity. Hence one must conclude that the residue 
of a function in the point at infinity is not necessarily zero if the function 
is regular there. In fact, the residue of f(z) at infinity is equal to the 
negative sum of its residues in all its singularities which occur for finite 
z-values. 

A simple example may illustrate this point more specifically. Suppose 
f(z) = 1/z. This function has a simple pole at z = 0 (with the residue 
unity) and is regular everywhere else. At infinity the function has a 
simple zero. A circular path enclosing the origin may alternatively be 
regarded as a circular contour enclosing the point at infinity. If these 
paths are separately traversed in their counterclockwise directions, the 
values of the resulting integrals are +22) respectively. Notwithstanding 
the fact that f(z) is regular at infinity, it is seen that the contour integral 
enclosing this point has a nonzero value. 

Conversely, one cannot conclude that the integral has a nonzero value 
for a contour enclosing the point at infinity if the function there has a 

. simple pole. For example, if f(z) = 2, the contour integral evidently has 
the value zero. 

The residue of a function in the point at infinity cannot be evaluated 
by any of the processes which apply to finite points. One might suppose 
that such an evaluation could be accomplished through first introducing 
the change of variable indicated by the substitution ¢ = 1/z, which inter- 
changes the origin with the point at infinity, and then proceeding in the 
normal fashion. Again the above example for the function f(z) = 1/z 
shows that this method is obviously incorrect. 


16. THE PARTIAL FRACTION EXPANSION OF RATIONAL FUNCTIONS 


Tf f(z) is a rational function, then, as pointed out in Art. 13, it has a 
finite number of poles in the entire z-plane. This number, the pole at 


308 FUNCTIONS OF A COMPLEX VARIABLE [Ch. VI 


infinity being excluded if present, may be denoted by 1, and the z-values 
corresponding to the poles of f(z) by 21, Z2, - - : 2n- The principal parts of 


Laurent expansions for the function about the points 21, 22, - ++ 2, are 
denoted by fy, he, -- - hy respectively. The function 
(2) — hae) — hale) — --- — In(2) = (2) [209] 


must be an entire rational function, that is, a polynomial in z whose 
highest power equals the order of the pole of f(z) at infinity. This fact is 
recognized through observing, for example, that f(z) — 4,(z) must be 
regular at the point z = z, because its Laurent expansion about z; has 
no principal part and hence is a Taylor expansion. However, the function 
f(z) — hy, (z) still has poles at the points ze, 23, - - : Zn. Next, the function 
f(z) — (2) — he(z) is seen to be regular at the points 2; and ze, and 
therefore has only the poles at 23, 24, - ++ n, and so forth. 

Transposing the principal parts in Eq. 209 to the right-hand side, one 
obtains the representation 


f(z) = In(z) + ha(z) +--+ + An(z) + gle) [210] 


in which each term places one of the poles of f(z) in evidence. It is an 
explicit representation of the function in the form of a linear superposition 
of the individual contributions of its singularities. This representation 
for the function f(z) is known as its partial fraction expansion. 

More specifically, if 


fia) = 202) _ coh ons + ast? fo + ans 
q(z) Bo + Biz + Boz? +.-.+ 6,2" 
and it is assumed that all the roots of g(z) are distinct, all the poles of f(z) 


[211] 


are simple. These roots may be denoted by 2), 22, -- - 2,. The principal 
parts of f(z) in its poles then have the form 
b_|” 
hy(g) = [212] 
in. which 6_,, b_,@, - -- 6.1™ are the corresponding residues. 


Applying Eq. 198 of the previous article, and noting that according 
to Eq. 195 $(z) is in this example ¢ = ¢/, one finds 


» — se) 
are dg [213] 


dz je=xu 


The derivative of g(z) for z = 2, may be further evaluated through noting 
that the factored form of this polynomial reads 


q(z) = Bn(z — 21)(2 — 2) +--+ (@ — an) [214] 
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and hence that 


d 
(2) ee: Bn (Sv — 21) (Zr — 22) +++ (Sr — Sr1)(Z — S41) +++ (Ze — Sn) 
[215] 
which may alternatively be written 


(Bd bare 2 


The way in which Eq. 215 is arrived at may readily be seen through 
first regarding g(z) in Eq. 214 as being in the form 


q(z) = (2 — a) - g*(z) [217] 


in which g*(z) is the right-hand side of Eq. 214 with the factor (z — z,) 
missing. Now applying the rule for the derivative of a product to q(z) as 
represented in Eq. 217, one finds that 


d dq* 
a =G@-%)F +9): [218] 
and hence that 
d 
=e = aCe) [219] 


which agrees ‘with Eqs. 215 and 216. 
In view of these considerations it becomes clear that the residues as 
given by Eq. 213 may ers be written in the form 


1? = =[( — By) ‘fl. =Zy [220] 


These results are unchanged if the polynomial g(z) has a zero root, that 
is, if By) = 0. They are restricted to the case of simple poles, of course, 
since the roots of g(z) are, in the above analysis, assumed to be distinct. 
When multiple roots occur, methods similar to those discussed in the 
previous article for evaluating residues at multiple-order poles must be 
used to determine the principal parts /,(z) in Eq. 210. 

If all the poles of f(z) are assumed to be simple, the results stated by 
Eqs. 212 and 220 determine the partial fraction expansion 210 except for 
the rational integral function g(z). This function is found from the form 
of f(z) given by Eq. 211 in the following way. If m Su, then f(z) is 
regular at infinity, and the entire rational function g(z) reduces to a 
constant. More specifically, if m = m, then 


g(z) = 7a [224] 


whereas if m < n, then g(z) is identically zero. 
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On the other hand if m > ~ and m — n = s, then f(z) has an sth order 
pole at infinity. The function g(z) is found through dividing g(z) into 
p(z) by long division so as to get 


* 
PS) Saha cl hain eyes [222] 
q(2) q(z) 
- in which the remainder polynomial p*(z) has the degree n — 1. Then 
g(z) = ¥o + vie +++ + 68" [223] 


For m = n this process yields g(z) = vo = on/Bn, whereas for m <n 
itis clear that g(z) = 0 as stated above. 
These considerations amount to putting f(z) into the form 
es! P*(z) _ 

f(@) = g@) + i= g(z) + f*(2) [224] 
in which f*(z) has a simple zero at infinity but contains the same poles 
as f(z) for the rest of.the z-plane. In these poles f*(z) has the same prin- 
cipal parts /,(z) as the function f(z), and these principal parts are found 
according to Eqs. 212, 213, and 220 by use of either p(z) and f(z) or 
p*(z) and f*(z) in these expressions, whichever appear to be more ex- 
pedient. 


17. MULTIVALUED FUNCTIONS; BRANCH POINTS AND RIEMANN 
SURFACES 


A multivalued function with which the reader undoubtedly has some 
acquaintance is the logarithm.* This function is defined by the integral 


_ a 
nz= fo [225] 


in which the path of integration in the ¢-plane extends from the point 
¢ = 1 to the point ¢ = z, but otherwise remains arbitrary. The integrand 
is the function 


A = 5 [226] 


which is regular everywhere except at the origin (¢ = 0) where it has a 
simple pole with the residue unity. . 

, Hence the value of the integral is not affected by a deformation of the 
path of integration so long as no portion of this path is allowed to sweep 


*Unless otherwise specified it is understood that the natural logarithm is implied. This is 
denoted by In z to distinguish it from the Briggs logarithm, which is written log z. 
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across the origin. With reference to Fig. 16, one has 
a (a_ ga [227] 
Py ¢ iP, £ 1 ¢ 


in which the subscript 1 on the closed contour integral indicates that the 
origin is encircled once in the counterclockwise direction. From Art. 15 
it is seen that therefore 


{2 7 2 = Qn} [228] 


Fic. 16. The integral on closed Fic. 17. The integral on closed 
path P,-P, yields 2xj since the path P,~P, again yields 277. Path - 
branch point at the origin is en- is different from that in Fig. 16 
circled once in counterclockwise but sense around branch point is 
sense. the same. 


The same result is true for the two alternative paths shown in Fig. 17. 
In general, if the path P, encircles the origin in the counterclockwise 
direction ” more times than the path P, does, 


dg 2 a a 
ao S. oo anit, [229] 


Hence if in conjunction with the definition 225 no statement is made 
relative to the path of integration, the value of the logarithm is deter- 
mined only within an arbitrary integer number of 22j’s. Its multivalued- 
ness is thus apparent. , 

This result is also readily obtained from the more familiar definition of 
the logarithm, according to which 


g=en [230] 
If Inz is here replaced by In z + 2rnj, the value of the exponential 


differs only by the factor e?*"%, which has the value unity. The view 
stated in the preceding paragraph, however, gives some pictorial signifi- 
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cance to this multivaluedness, whereas the more elementary reasoning 
precludes the possibility of such an interpretation. 

For further elucidation, the following 
more detailed representation is helpful. The 
complex number z is written in the polar 
form 


z= rei? [231] 


As shown in Fig. 18, the path of integration 
is assumed to consist of the portion Z; coin- 
cident with the real axis from 1 to 7, followed 
by the portion Lz, which is an arc drawn 
from the point 7 on the real axis to z. 

The variable of UiteaRe non is also written 
in the polar form 


Fic. 18. Integral of 1/z along 


L, contributes to real part of t = per? [232] 
logarithm; that along Zz con- whence 
tributes to imaginary part. ; z 
| dt = ay 4 2 ag (233] 
op” 00 
or : 
dt = e7? dp + jpe?* do = (dp + jp dé)e?® [234] 
Then 
OF [235] 
c Pp 


The integral 225, separated into two parts corresponding respectively to 
the portions L, and Lz of the resultant path of integration, reads 


_ (45 [" 
Inz= f'" +i f da [236] 
a Lo 
Now 
dp 
—=Inr [237] 
1 p 


is the logarithm of the magnitude of z, whereas . 
f di =¢ [238] 


Ing =Inr+j¢ [239] 


The multivaluedness of the logarithm is seen to result from the addition 
or subtraction of an integer number of complete revolutions to the path 


is the angle of z. Hence 
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Ly. This operation simply adds positive or negative integer multiples 
of 2x to the upper limit of the integral 238 and hence to ¢ appearing in 
Eq. 239. Written in more complete form, this relation, therefore, reads 


Inz=Inr+j¢ +720 [240] 


in which is any positive or negative integer. 
For the detailed discussion of the multivalued character of the loga- 
rithm it is expedient to define the value of Eq. 240 for x = 0, namely, 


. Inz =Inr+j¢ . [241] 
as the principal value of In 2, and write 
Ing=Lnzg+j2xen © [242] 


Ordinarily in speaking of the logarithm one has in mind the principal 
value only. 

Now that the multivalued character of the logarithm and the reasons 
for it are established, attention may be given to the question of how the 
ambiguity can be taken care of when this function enters into some 
problem, for example, in the consideration of the mapping of w = In z 
in the w- and z-planes. To a given point in the z-plane there correspond 
an infinite number of points in the w-plane, all of which have the same 
real part whereas their imaginary parts differ by multiples of 27. 

Such a set of points in the w-plane is presented in Fig. 19, which shows 
that the entire z-plane is mapped in any one of the oppositely cross- 
hatched strips, 27 units wide. Horizontal lines in the w-plane correspond 
to radial lines (@ = constant) in the z-plane; vertical lines in the w-plane 
correspond to concentric circles about the origin (r = constant) in the 
z-plane. The system of concentric circles and radial lines in the z-plane 
is, therefore, transformed into a rectangular grid in the w-plane. 

The fact that the locus joining a set of points in the w-plane corre- 
sponding to the same z-value (like the vertical dotted line in Fig. 19) 

. becomes a circular locus in the z-plane which winds around and around 
the origin suggests that the multivaluedness may be eliminated artificially 
through conceiving the z-plane in the form of a winding surface comprising 
an infinite number of superimposed leaves which have a common central 
point at the origin and simulate a winding staircase of infinite width in 
which the steps are replaced by a smooth continuous ramp. As the slope 
of this winding surface is made smaller and smaller, the spacings between 
‘the leaves (or successive elevations of the ramp) ultimately become 
negligibly small. 

The one original z-value representing an infinite number of w-values 
differing by j72n is now separated into a uniquely corresponding infinite 
number of z-values which lie directly above and below each other on the 
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various stages of the ramp or levels of the continuous winding surface. 
Each level, or leaf, of the surface carries one of the separate z-values 
which now corresponds uniquely to one of the w-values. To go from the 
z-value representing a given w-value to that representing the value 
w + j27, one must follow the ramp once around the origin in the counter- 
clockwise or clockwise direction respectively. The winding sense of the 
ramp is such that the increment in w is +j2z for one revolution in the 
counterclockwise direction. 
ly This hypothetical surface which 
w 


| Yi YT, y Y Yy WY YY thus effectively renders the func- 
Y, MU ip x tion single-valued is a Riemann 

Ye WCW SS | 
" \ NY: point about which the surface 


surface. The origin or common 
D277 
Wj 6 Y YYUAY winds is called the winding point or 
Ky Y/// Wy iY, 2 branch point of the Riemann sur- 
q ( é & < face. It evidently is a singularity, 
Nar . for the logarithm function does not 
SOS 
2 
In r—> 


have a finite value there.* With 

the exception of this point and the 
Fic. 19. Sections of the w-plane each sae sas inatys ue renCuOn ale 
of which corresponds to a leaf of the 18 regular in the entire Riemann 
Riemann surface of In z in the z-plane. surface, and its integral around 

any closed contour is zero because 
it is impossible to wind about the origin in the same sense a nonzero 
number of times and return to a given starting point. 

The logarithm is an illustration of a multivalued transcendental 
function. The Riemann surface for such a function has an infinite number 
of leaves because the multivaluedness is infinite. The inverse trigonomet- 
ric functions tan! z, sin™! z, cos! z, etc., are other examples. 

A class of multivalued functions having a finite degree of multi-: 
valuedness, and hence possessing Riemann surfaces with a finite number 
of leaves, is found in the algebraic functions.t A simple example is the 
function 


w= Vz — 2% [243] 


In view of the preceding discussion it is convenient to write this expression 
in the form 
w=e 1/2 Lin (2—20) ++-jan [244] 

*It should not be inferred that branch points necessarily are singularities of this kind. It is 
possible (as in the case of many algebraic functions) for the function to have a finite and 
unique value at a branch point. 

fNot all algebraic functions are necessarily multivalued. The rational functions, for 
example, are single-valued algebraic functions. 
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For a given value of z, this function has two values, one for n = 0 and 
the other for 7 = 1. For x = 2 the same value again obtains as for n = 0, 
and for » = 3 the (m = 1)-value is repeated, etc. The Riemann surface 
for this function, therefore, has but two leaves. The branch point is 
located at 2 = 2p. 

The structure of the Riemann surface about z = 2 is similar to that 
of the logarithm function about its branch point, but the complete 
surface must now be so constructed that the same leaf is regained after 
two complete revolutions about the branch point. The first step toward 
clarifying this picture involves a recognition of the fact that the point at 
infinity has the same character as the point zo. This circumstance is 
somewhat easier to see if the variable is changed by the transformation 


1 
3— Zo 


[245] 


Then Eqs. 243 and 244 become 
w= (g/)TH2 = e HR inet [246] 


in which the point z’ = 0 corresponds to z = ~. 

Since the point at infinity must be included in the present visualization 
process, it may be a little easier if the z-plane is replaced by its associated 
complex sphere. This sphere is imagined to have a double surface. Both 
surfaces are slit from the point z = 49 to the point at infinity along a path 
which is arbitrary except that it shall have no crossover points. Through- 
out the entire length of this cut, the top leaves of the double surface are 
now imagined to be joined to the bottom leaves on the opposite sides of 
the cut (a physical impossibility, of course, but not beyond the powers 
of a good imagination) so that crossing the cut in either direction effects 
a transfer from the upper to the lower leaf of the Riemann surface or 
vice versa. This continuous duplex ramp is called a branch cut. 

The resulting duplex surface evidently has the properties required for 
the unique mapping of the function given by Eq. 243. The two z-values 
corresponding to the two w-values that differ by a factor e’™ lie above 
one another on the two leaves of the Riemann surface, and to go from 
one of these g-values to the other it is necessary to cross the branch cut 
and pass completely around either the branch point at ¢ = 4 or that at 
z = o, The same z-value may be regained by passage around a branch 
point an even number of times. 

The function 


w= V(e — %)(% — %) +++ (8 — Be) [247] 


is also double-valued, and its Riemann surface again has two leaves. 
The branch points are 21, 22, ° * * 2¢. For very large values of z the function 
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behaves like z*/?, and hence it is clear that the point at infinity is also a 
branch point if & is odd. 

The branch cuts of the Riemann surface are made along paths joining 
the points z, and zg, 23 and 24, etc. These paths cannot have any crossover 
points. In particular, for k = 2 the branch points are z = 2, and z = 2, 
which are joined by a branch cut in a manner exactly analogous to that 
discussed for the points z = 29 and z = © in. the preceding example. 
In fact, the two-leaved Riemann surface for the function 247 with 
k = 2 is in all respects similar to that for the function 243 except that the 
second branch point lies at a finite z-value. A closed path which encircles 
both branch points 2; and z, remains on the same leaf of the Riemann 
surface; that is, a given point z on 
this closed path is regained after a 
single traversal. 

This situation may be clarified by 
reference to Fig. 20. A traversal en- 
closing both points 2, and gg which 
begins and ends at P must be equiva- 
lent to a traversal around the closed 
path L, followed by a traversal around 
the closed path Le, because the differ- 
: . ence amounts only to the traversal 
Fic. 20. A point traversing Liand from P to Q and back to P, around the 
L, in succession must return to the . ; 
same leaf of the Riemann surface, Saded area which contains no branch 

point. Inasmuch as a separate traver- 
sal around LZ, or around Lz effects a transition from one leaf of the 
Riemann surface to the other, it is clear that a given point traversing 
these two circuits in succession must return to the same leaf of the 
Riemann surface. 

Analogous reasoning shows that the cases k = 3 and k = 4 have 
similar Riemann surfaces. For k = 4 the finite point z4 replaces the 
branch point which for k = 3 occurs at infinity. The surface has two 
branch cuts, and a closed circuit which surrounds any two branch points 
returns to a given point after one traversal. The extension of this reason- 
ing to the interpretation of the two-leaved Riemann surfaces for larger 
values of & is straightforward. 

An extension of the same reasoning likewise leads to the required 
structure of the Riemann surface for the function 


w= Ve— 2% [248] 


or 
Lin (2-20) +9 en 
em = m 


[249] 


This function is m-valued, has a branch point of the (m — 1)th order 


w= 
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at the point z = zp, and possesses a Riemann surface having m leaves. 
The point at infinity again has the same character as the point z = 29, 
and these two points are joined by a slightly more complicated branch 
cut. At this branch cut the top leaf of the Riemann surface is joined to 
the one located below it on the opposite side of the cut, whereas the 
second leaf on the original side is joined to the third leaf from the top on 
the opposite side, etc. Finally, the bottom-most leaf on the original side 
of the cut is joined to the top leaf on the opposite side. The result is that 
a path must encircle either branch point m times before a given point 
on one of the leaves can be regained. 


It may be useful to observe that the function w = <z has a Riemann 
surface entirely similar in its structure to that of the function w = Inz 
except that the latter has an infinite number of leaves. The branch cut 
in the case of the function w = Vz is a necessary concept only because 
some mechanism must be imagined whereby the bottom leaf rejoins the 
top leaf of the surface. Actually leaving this mechanism entirely to the 
. imagination is more effective than trying to formulate some sort of 
piecing and pasting process between the leaves which must afterward 
be apologized for because of the mechanical impossibility of carrying out 
such a scheme in a physical model. 

A mathematician prefers not to be annoyed with the physical difficulties 
involved in the visualization of a branch cut, the more so since it implies 
the existence of a definite path along which the passage from one leaf to 
another takes place. According to the true conception of a Riemann 
surface, such passage is not to be regarded as localized in a branch cut. 
Rather, one is to regard the invention of a branch cut as made necessary 
only by reason of the inadequacy of one’s habitual conception of space to 
comprehend the mechanism of the Riemann surface. 

In the function w = Inz this difficulty does not appear, but it is 
replaced by the equally difficult conception of an infinite number of 
leaves in the Riemann surface. In terms of the complex sphere associated 
with the z-plane, the south pole (origin) and the north pole (infinity) are 
branch points of infinite order. The Riemann surface is a continuous 
succession of spherical shells winding round and round a polar axis, 
somewhat on the order of a snail’s shell only that the winding pitch is 
zero and the number of revolutions infinite. The surface for the function 
w = 2 has the same structure but comprises only m revolutions, after 
which identity with the original leaf is again established. 


18. ALGEBRAIC FUNCTIONS; MORE ABOUT THE CLASSIFICATION OF 
FUNCTIONS 


From the discussion just given it is clear that if the function w = f(z) 
has a branch point of the (m — 1)th order in z = 4p, then, by use of the 
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substitution 
Z— 2 = re’* [250] 
one may study the function 
f(@) = f(@ + re7*) (251] 
for an appropriate value of r on the various leaves of its Riemann surface 
through letting ¢ vary continuously from some initial value ¢o to a final 
value ¢9 + 2m. A further increase in ¢ will merely yield the same values 
over again. 
The m leaves of the Riemann surface for this vicinity of the point 
Zq are encountered in cyclic order in the intervals 


oo +2» — 1)e <4 < bo + Qe [252] 


with vy = 1,2,-+-m. 
The m-values of the multivalued function f(z) may correspondingly 
be denoted by 


WwW, = to + pelle +20—Dal) [253] 
' with ¢9 < @ < ¢o + 2z. 
The quantities w1, we, --- Wm, which represent the function on the 


various leaves of its Riemann surface, are called the branches of the 
function f(z). They evidently form a cyclic group, since, for the interval 
bo + 24 <¢ < do +42, for example, w; replaces we, we replaces ws, 
and so forth, and w,, replaces w. 

In terms of these branches of the function w = f(z), an important 
property of the branch point of finite order may now be stated, namely, 
that the same limiting value of the function results for the limit z— z9 
regardless of which one of the branches w, we, - - - etc., is chosen in the 
process of evaluating this limit. In other words, the value of f(z) in the 
branch point may be approached from any one of the m leaves of the 
Riemann surface. Since the branch point thus yields a unique value for 
the function, it may be included as a point of this surface. 

It is possible for a multivalued function to have this character in the 
vicinity of a point z = Z and there possess an infinite or finite value, 
either of which is definitely determinable. A singularity of this sort has a 
more general character than that of either a branch point or an ordinary 
pole, since it embraces both these as special cases. It is called an algebraic 
singularity inasmuch as it represents the only kind of singularity found 
in that class of functions known as the algebraic functions. 

In order to study the behavior of an m-valued function in the vicinity 
of an algebraic singularity at the point z = %, one may make the change 
of variable indicated by 


t= (z — 29)!/™ [254] 
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Then 
w= fe) =¥@) [255] 


becomes a single-valued function of the complex variable #, since the 
latter has taken over the m-valued character in terms of the original 
variable z. If the function ¥(#) possesses an th order pole at the point 
t = 0 (corresponding to z = Zo) then, for the vicinity of this point, it 
may be represented by a Laurent expansion whose descending part 
contains terms. Hence it becomes clear that the function f(z) admits 
the following expansion in the vicinity of an algebraic singularity: 


fe) = E Cale — 50y"™ [256] 


This singularity is an ordinary pole of the mth order for the function f(z) 
if m = 1. It represents an ordinary branch point of the (m — 1)th order 
ifn = 0. 

In general, that is for m > 1 and n > O, the function f(z) is said to 
possess a singularity at the point z = z which is simultaneously a pole 
of the order n and a branch point of the order m — 1. For example, the 
function f(z) = ’z has a branch point of order 1 at z = 0, whereas at 
z = o it has simultaneously a simple pole and a branch point of order 1. 
The same statement, with an interchange of reference to the points z = 0 
and z = ©, applies to the function f (2) = 1/ V3. Tf nis infinite, the func- 
tion has an essential singularity at the branch point of order m — 1. On 
the other hand, if m is infinite, the function is said to have a logarithmic 
branch point (it is then no longer an algebraic function, for the latter can 
have branch points of finite order only). 

Algebraic functions are defined as functions possessing only a finite 
number of algebraic singularities. If the function is m-valued, it is not 
necessary that all or that even one of its singularities also be a branch 
point of the order (m — 1), but a sufficient number of these singularities 
must be branch points of such order and distribution as will insure that 
the m leaves of the associated Riemann surface form a connected system. 
For any nonsingular z-value the function f (s) possesses the m-values 
indicated by 

f(@) = wi, Wa, > ++ Wm [257] 


which represent the function on the correspondingly numbered leaves of 
its Riemann surface. In terms of these branches of the function f(z). one 
may form the system of symmetrical functions 


$i(Z) = w+ we ties + Wm 


Yo(Z) = WrWe + Wyw3, +++ + Wn-1Wm 
Se eden alee wr this Gewese iad date ine eaten [258] 


f 
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In view of the fact that the branches w,, we, ++ - Wm form a cyclic group, 
it follows that all the symmetrical functions ¥, Yo, --- Wm are single- 
valued. That is, as the point z is allowed to move along any path on the ' 
Riemann surface, avoiding only the singularities of f(z), the functions 


W 1, We, *** Wm can merely interchange their identities, and since the 
functions ¥1, ¥2, °** Wm are given by symmetrical combinations of the 
elements w1, Wo, ‘** Wm, they are not affected by such interchanges. 


Furthermore, they can have only algebraic singularities because this is 
the only kind that the branches w;, we, - -- Wm possess, and the y’s are 
formed from the w’s by the processes of addition and multiplication alone. 
Algebraic singularities in the case of single-valued functions, however, 
must be ordinary poles. Hence the functions y1(z), ¥2(z), + + + ¥m(z) must 
be rational. 

Now it is to be recalled from the theory of algebraic equations that the 
symmetrical functions defined by Eq. 258 satisfy the equation 


(w — wi)(w — We) +++ (W — Wm) 
= yw" — v1 (z) * wrt i (—1)"Yn (z) = 0° [259] 


It is also recognized that the functions 258 have a common denominator 
equal to the product of the denominators of the branches w, we, --- Wm, 
and that this common denominator must be a rational entire function 
of z, from the last of Eqs. 258, or alternately because the branches w, 
We, ‘+ Wm individually are single-valued on their respective leaves of the 
Riemann surface and there possess singularities which, therefore, can be 
none other than ordinary poles. Multiplying Eq. 259 by this rational 
entire function, which is a finite polynomial f(z), one finds 


F(z,w) = po(z)-w™ + pi(z)-w™! +++ + dm(z) =0 [260] 


in which fo, f1, --- Pm are finite polynomials in z. 

The algebraic functions w = f(z) are thus seen to be defined as the roots 
of an algebraic equation whose coefficients are ordinary polynomials in 2. 
More precisely, the roots are the branches which: collectively define a 
single algebraic function on the various leaves of its Riemann surface. 
The function F(z,w) may never be reducible to the product of two or more 
factors having the same form as F(z,w), since the vanishing of any factor 
alone would then satisfy Eq. 260, and hence several independent functions 
rather than a single one would be defined by this equation. 

The branch points occur for those values of z for which Eq. 260 has 
coincident w-roots. The coincidence of roots requires that the discriminant 
be zero; and since this discriminant is a rational function of the poly- 
nomials ~o, £1, : +: Pm and hence a rational function of z, it cannot vanish 
identically, but can do so only for a finite number of z-values. Hence 
the number of branch points of the algebraic function defined by Eq. 260 
is finite. 
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The engineering student will find it helpful to visualize the Riemann 
surface as a set of m parallel metal sheets and the branch points as spot 
welds which join two or more of the sheets at isolated points. The dis- 
tribution of these spot welds and the number of sheets held together by 
each must evidently be such that the m sheets are connected; otherwise 
instead of a single m-valued function, several functions of lesser order 
than m are defined. 

The classification of functions in terms of the nature and distribution 

_ of their singularities, which is partially discussed in Art. 13, may now be 
viewed in a more thorough fashion. All analytic functions may be divided 
into two main classes, which are the algebraic and the transcendental. 
In other words, any function which is not algebraic belongs to the 
transcendental group, comprising single- or multivalued functions having 
essential singularities. Algebraic functions, on the other hand, may be 
further subdivided into single-valued and multivalued functions. ‘The 
single-valued ones are identified with the rational functions, which 
include some of the entire functions, namely, the finite polynomials. 
The meromorphic functions, which are ‘single-valued and may have any 
kind of singularity at infinity but must have only poles in the finite 
z-plane, include some of the functions in the transcendental group and 
all the functions in the rational group. The entire functions may likewise 
be regarded as a subclass in the transcendental group, but in addition 
they lay claim to some of the functions in the rational group. The follow- 
ing block diagram is intended to unify these remarks. 


TRANSCENDENTAL 
single- or multivalued, 
essential singularities, 
branch points, poles, and 
anything else, unlimited 
or limited in number 


ALGEBRAIC 
finite number of 
algebraic singularities 


ENTIRE MERO- 
MORPHIC 


unlimited _ or 


SINGLE-VALUED 
or 


MULTIVALUED 
finite number of 


Baath acini ead RATIONAL | limited number 
ree ‘Lyn re finite number of poles in finite 
possibly poles of poles z-plane, anything 


at infinity 


It is now possible to discuss somewhat more adequately the inversion 
of functions (which is introduced in Art. 3) with particular reference to 
the vicinity of saddle points described in Art. 14. Suppose a given single 
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valued function w = f(z) has, at a point z = 20, vanishing first and higher 
derivatives up to but not including the wth. For the vicinity of this point 
it then possesses the Taylor expansion 


Ww = Wo + an(Z ~ 20)” + Gngi(s— so)" +--+ = [261] 
with a, ¥ 0. Writing this equation in the form 
DAY gay) +H Gate. [262] 


nr 


extracting the mth root, and introducing for abbreviation the variable 
_ 1/n 
= (2#—*) [263] 


one may put the series 262 into the form 


jn 
7 = @ — %) {i + S24 @ = m9) + SY Gayo} [264] 


The bracketed expression, inclusive of the exponent 1/n, may for the 
vicinity of the point z = 29 (which is not a branch point) be expanded 
in a Taylor series having the form 


1+ by(z — 20) + bols — zo)? +e [265] 


This Taylor series represents the bracket function in Eq. 264 on only 
one of the leaves of its Riemann surface. The corresponding representa- 
tions on the remaining » — 1 leaves of this surface, however, differ from 
265 only by the factor 


ef@r7™ fory = 1,2,---n—1 [266] 
Substituting 265 for the bracket expression in Eq. 264 yields 
+ = (g — 29) + by(z — 20)? + bale — 20)? + --- [267] 


which, on the particular leaf of the Riemann surface in question, is a 
unique representation for 7(z) in the vicinity of the point z = 29. Hence, 
according to the discussion in Art. 3, it possesses an inverse function 
whose Taylor series for the vicinity of + = 0 (which is a regular point) 
has the form 


(2 — 20) = Bit + Bor? + Bsr? ++ [268] 


The coefficients in this series for the inverse function may, for example, 
be found through first substituting the series for (2 — 20) into Eq. 267 
and, after arrangement in ascending powers of 7, obtaining 


7 = Bit + (b181? + B2)r” + (2618182 + 28, + Bs) +++ [269] 
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whence, equating coefficients of like powers of +, one has 


Bi om 1, 
Be = —by, [270] 
B3 = 2b," — be 


ee er} 


Substitution of the expression 263 for r into Eq. 268 then gives 


ee 1l/n ou 2/n = 3/n 
ee oo ee ee 


which is a representation for the inverse function z = ¢(w) for the vicinity 
of the point w = wo, corresponding to the point z = Z. Equation 271 
may be written 


2 = $(wWo) + c1(w — wo)!” + co(w — wo)?!” +--+ : [272] 


This, however, is recognized (according to the discussion leading to Eq. 
256) as the expansion of an m-valued function in the vicinity of a branch 
point of the (7 — 1)th order at the point w = wo. Hence it is established 
that if a- given function w = f(z) has a saddle point of the (m — 1)th 
order at a point z = Zo, the inverse function z = ¢(w) has a branch point 
of the (w — 1)th order in the corresponding point w = wo. If the point 
% = 2 is a zero of the mth order for the function w = f(z), the above 
analysis remains unaltered except that wp becomes zero. The inverse 
function z = ¢(w) then has a branch point of the (w — 1)th order at the 
origin in the w-plane instead of at the point w = wy. 


19. A THEOREM REGARDING THE NUMBER OF ZEROS AND POLES 
WITHIN A GIVEN REGION; THE FUNDAMENTAL LAW OF ALGEBRA 


A given function w = f(z) is assumed to have a zero of the order a 
in the point z = Z9. The Taylor expansion about this point then reads 


f(@) = Ga(Z — 20)% + dapi(% — 2 )et + +> [273] 
and the derivative of f(z) is given by 
f' (2) = ada(z — 29)°* + (a + 1)dagi(@ — Z)*+°++ [274] 
Dividing the series 273 into the series 274 by long division gives 


as a(Z — Zo)? + eo + c1(% — 20) + ca(s — 2)? +--+ [275] 


This function, therefore, is seen to have a pole of the first order in the 
point z = Zo with the residue a. 
Alternatively, let it be supposed that the given function f(z) has a pole 
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in z* of the order 8. Then its Laurent expansion about this point is given 
by 


f(a) = bgs— BY + beue— ey +e. [276] 
and the derivative reads | 
f' (2) = —Bb_p(z — 2*)F — (6 — 1)b_eui(g — yO +--+ [277] 
By long division it is then found that 


ae) = —B(¢— +d) +d(e— 2) + do(z — o)? +--+ [278] 


from which this function is seen to have a simple pole in the point zs = 2* 
with the residue —. 

According to the residue theorem it follows from these considerations 
that 


f@) 
c f(z) 
where the path C is assumed to enclose no other zeros or poles of the 


function f(z) except those at the points zp and 2*. 
In general it is seen that if the path C encloses zeros of f(z) having 


dz = 2xj(a — 6) [279] 


the orders a1, a2, + - + a@%, and poles having the orders 61, Be, --- Bs, and if 
a tagt-:++a,=N [280] 
and a 
By + Bo +-++ +8, =P [281] - 
then 
/ 
os pre dz=N—P [282] 


The integer NV is equal to the total number of enclosed zeros of the func- 
tion f(z), each one being counted as often as its order requires; the integer - 
P is equal to the total number of enclosed poles of the function f(z), 
each one likewise being counted as often as its order requires. The result 
given by Eq. 282 states that the contour integral formed for the ratio 
t'/f, when multiplied by 1/27, is equal to the total number of enclosed 
zeros of the function f(z) diminished by the total number of enclosed 
poles, each of these being counted as often as its order requires. 

An application of this result to a special case is of particular interest. 
Here the function f(z) is assumed to be the finite polynomial 


$@) = Bt dys! toe aye + ay [283] 
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Then 


fe) m3 + (mn ~ Nay? + hy 
f@) 2" + dy_12""* + +++ 4+ a2 + a 


Dividing the denominator into the numerator by long division, one obtains 
a series of the form 

f@) _m by | bs 

ayers ne 2 + 2 [285] 
which converges only for z-values lying outside a circle enclosing all the 
zeros of the function f(z). In the region outside this circle it represents the 
Laurent expansion for the function f’(z)/f(z), whence the integer 1 
is recognized to be equal to the sum of the residues of this function in 
its poles, all of which are located within the circle. Consequently, if C 
denotes this circular contour, then 


1 ¢f® _, 

aaj Jo f(z) 
Inasmuch as the function f(z), given by Eq. 283, has no poles inside the 
contour C, the conclusion follows that the highest power 1 of the poly- 


nomial 283 equals the total number of roots of the algebraic equation 
f(z) = 0. This is the familiar fundamental law of algebra. 


[284] 


[286] 


20. A METHOD FOR THE DETECTION OF ZEROS WITHIN A GIVEN 
REGION 


Another useful application of the result stated by Eq. 282 in Art. 19 
is the following. If a given function w = f(z) is known to have only zeros 
within a region enclosed by the contour C, then 


1 ¢f®© 
Qj Jc f(z) 
equals the number of enclosed zeros. According to the principles of con- 


‘ formal mapping, the contour C in the z-plane determines an equivalent 
contour in the w-plane. The integral 287 may correspondingly be replaced 


by 


dz=N [287] 


lcs [288] 


in which D is a closed contour in the w-plane corresponding to the path 
C in the z-plane. From the discussion of the logarithm function in Art. 
17, it is recognized that the integer N in Eq. 288 must equal the number 
of times that the contour D in the w-plane encircles the origin. If this 
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contour encircles the origin more than once — that is, if V > 1 —it is 
clear that the inverse of the function w = f(z), namely, z = ¢(w), is 
multivalued, and that its Riemann surface in the w-plane has such a 
structure that a path corresponding to C in the z-plane closes upon itself 
after N complete circuits around the origin in the w-plane. 

‘The contour C may be located anywhere in the z-plane. The conclusion 
to be drawn is that the number of zeros of f(z) enclosed by C (each being 
counted as often as its order requires) is given by the number of times 
that the corresponding closed contour D encircles the origin in the w-plane. 
If the closed contour D (which may, when plotted in the simple w-plane, 
be any closed curve with or without crossover points) does not encircle 
the origin in the w-plane, the given function f(z) cannot possess any zeros 
within the region enclosed by C. Hence if it is to be established whether 
a given function f(z) possesses zeros within a stated region, the question 
may be answered through mapping in the w-plane the locus corresponding 
to a boundary enclosing the stated region in the z-plane, and noting 
whether this closed locus in the w-plane does or does not encircle the 
origin. 

It is, of course, necessary to know that f(z) does not have poles 
within the region in question. The method is evidently also applicable 
to the determination of the presence of poles within a given region if 
the function is known to have no zeros there. In either case, obviously, 
neither zeros nor poles should lie upon the contour C. 

In applying the method to practical problems it is helpful to note the 
significance of several detailed characteristics which the contour D may 
exhibit. The contour C is assumed to enclose a simply connected region 
in the z-plane. If this region contains neither zeros nor poles, the cor- 
responding contour D in the w-plane, as stated above, does not enclose 
the origin; this characteristic of the contour D, however, may not always 
be immediately evident. The inverse function z = ¢(w), except in trivial 
cases, is multivalued, and hence the region enclosed by the contour D 
may have overlapping portions which actually, of course, lie on separate 
leaves of the associated Riemann surface. Thus, if the region enclosed 
by the contour C contains saddle points (see Art. 14), the contour D 
encloses the corresponding branch points, and hence makes several com- 
plete circuitations about portions of the region enclosed by it. One 
readily appreciates, therefore, that the contour D may be a rather 
tortuous path even though the contour C is a simple one. It is possible for 
the region enclosed by the contour D to surround the origin completely 
and yet not contain the origin. 

The closed contour C is ordinarily assumed to be traversed in the 
counterclockwise direction, and the region enclosed by it is taken to be 
that on the left of this contour. The corresponding direction of traversal 
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for the contour D may always be established through considering any 
two neighboring points on C and determining the corresponding neighbor- 
ing points on D. According to the principles of conformal mapping, the 
region enclosed by D lies to the left of this contour, but this observation 
alone is frequently insufficient to enable one to tell by inspection whether 
the enclosed region contains the origin, and if so, how many times the 
contour D encircles the origin. 

A method for the correct evaluation of this situation is the following. 
One imagines a radius vector extending from the origin of the w-plane 
to a variable point w on D corresponding to the variable point z on.C. 
As z traverses C and w traverses D, this radius vector changes in length 
and in its angular position. For a complete traversal of z around C, the 
net change in the angle of the radius vector must obviously equal an 
integer number of 27 radians (a change is positive if it corresponds to 
rotation in the counterclockwise direction). The value of this integer 
equals the number of times that the contour D encircles the origin. 

If the function w = f(z) is analytic within the region enclosed by C, 
this integer cannot be negative, but it can become negative if the region 
also contains poles. For example, suppose f(z) = 1/z and let C be any 
contour enclosing the origin in the z-plane. Then it is clear that the integer 
in question equals —1. One may say that the contour D in this case 
encloses the point at infinity in the w-plane. If C is the unit circle, in 
this simple example D is likewise a unit circle. The enclosed region in 
the z-plane is that within the unit circle; the enclosed region in the w-plane 
is that lying outside the unit circle. The requirement that the function 
f(z) be known to have no poles (or other singularities) within the region 
enclosed by C is again recognized as being necessary, since the presence 
of a pole cancels the effect of a zero as far as the net value of the integer 
is concerned. ‘ 


21. THE PRINCIPLE OF THE MAXIMUM MODULUS; ROUCHE’S THEOREM 
AND SCHWARZ’S LEMMA 


To continue the considerations of the previous article, if the function 
w = f(z) is analytic on C and within the region enclosed by it, and if on 
this contour |f(z)| < M (some real positive quantity), it follows that 
the contour D and its enclosed region must lie within (or be tangent to) 
a circle of radius M concentric with the origin of the w-plane. This fact 
is clear from the consideration that the region enclosed by D cannot 
extend beyond a boundary formed by those confluent segments of the 
contour D which are farthest (but still at a finite distance) from the origin; 
otherwise the point at infinity would be contained within the region, 
thus contradicting the assumption that f(z) is analytic within C. Since 
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every point within the region enclosed by C yields a point in the w-plane 
which lies within the region enclosed by D, it follows that lf(2)| SM 
for all points within C. Moreover, the equality sign in this relationship 
holds only if the region enclosed by D consists of a single point, in which 
case it holds identically, for then f(z) must reduce to the constant M. 

This result is known as the principle of the maximum modulus. It may 
readily be demonstrated analytically with the help of Cauchy’s integral 
formula, Eq. 80. The closed contour S (upon and within which f(z) is 
analytic) is identified with a circle of radius p about some finite point 2. 
Then 


g— 2 = pei? [289] 
f(s) =F(@ + pe¥?) [290] 
and 
a =jda [291] 
Hence one has 
1 ; 
fe) = 5- [fe + ef) a [292] 


from which the value of the function at the center of the circular region 
is seen to equal the arithmetic mean of its values on the boundary. 

Since the mean of a set of complex values must be less than or at most 
equal to the mean formed from the magnitudes of these values, it follows 
that the magnitude of f(z) must surely be less than or at most equal to 
the largest value which the magnitude of the function assumes on the 
circular boundary. That is, if M is the maximum value of | if (¢)| on the 
boundary, | f(z)| < M. . , 

This result may be generalized to the extent that the boundary need 
not be circular and the point z may be any internal point. For if the result 
were not also true in this more general case — that is, if the maximum of 
the absolute value of the function did not occur on the boundary but at 
an internal point — by applying the specialized result to an appropriately 
chosen small circle about this internal point, one would clearly encounter 
a contradiction. This line of thought also shows at once that the equality 
of | f(z)| and M can hold only if it holds identically, that is, if f(z) equals 
the constant M. 

If the function f(z) has no zeros (as well as no poles) upon or within 
the closed contour, by applying the same reasoning to the reciprocal 
function 1/f(z), one recognizes that both the minimum and the maximum 
values of |f(z)| on the boundary are minima and maxima for the enclosed 
region. 
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An alternative proof, which is collaterally interesting, begins with the 
result expressed by Eq. 287 of the previous article. The function f(z) is 
analytic upon C and within the region enclosed by this contour. A second 
function g(z), in addition to satisfying the same analyticity conditions, 
fulfills the relation , 


lg@)| < |F()| [293] 
on the boundary C. On this boundary one, therefore, has 
g(z) 
rane 1 
4@) < [294] 
and hence the closed contour for the function 
-1488 % 
w(e) = 1+ 55 [295] 


in the w-plane, corresponding to the contour C in the z-plane, clearly 
cannot enclose the point w = 0. According to the discussion in Art. 20, 
therefore, one has 


at 0 [296] 


Using Eq. 295, one now finds 


dw _f-d(ft+s)—(ft+g)-F _Uf+s) Ff 


= Pe NS es oe Le 297 
w 10 +8) jor f° 
Hence in view of Eq. 296, the result expressed by Eq. 287 gives 
d 
1 gdft+s)_ 1 GF _ [298] 


dado ft+g Ande f 
from which it may be concluded that, under the conditions stated above, 
the function f(z) + g(z) has the same number of zeros within the region 
enclosed by C as does the function f(z). 

In terms of this result (known as Rouché’s theorem), the principle of 
the maximum modulus is easily proved. Suppose the maximum of 
|f(z)| did not occur on the boundary C but at some internal point go. 
Then on the boundary one would have 


1 1 ; 
[Feol ~ T7@l [299] 


and hence one could conclude that the function [1/f(zo) — 1/f(z)] has 
the same number of zeros within the region enclosed by C as does the 
function 1/f(z). The latter has no zeros within this region because f(z) is 
analytic there, and the function [1/f(zo) — 1/f(z)] has at least one zero ‘ 
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within the region, namely the one for g = zg. The supposition is, there- 
fore, untenable. 

A more specialized result is obtained if one considers the function f(z) 
to be analytic within a circle of radius R about the origin and to equal 
zero at the origin. It then possesses the Maclaurin series 


f(z) = ayz + agz + age? +--- [300] 


from which it is clear that the function 
6) -2 [301] 


likewise is analytic within the circle of radius R. If on the circular bound- 
ary it is known that 
lf@| su [302] 


then it follows that there 
M 
b@lsz [303] 


and, according to the principle of the maximum modulus, that the 
magnitude of ¢(z) is less than (or at most equal to) M/R for all points 
within the circle. Hence one has the result that 


lf@)| s fs -M [304] 


for all points within the circle, and the equality can hold only if it holds 
identically, in which case f(z) reduces to a complex constant with the 
magnitude M/R, multiplied by z. 

This particular result is known as Schwarz’s lemma. It has numerous 
practical applications in problems involving conformal mapping.* 


22. SoME USEFUL CORRELATIONS WITH POTENTIAL THEORY; 
PoIssONn’s INTEGRALS AND HILBERT TRANSFORMS 


In Art. 5 it is pointed out that the real and imaginary parts ~ and » 
of a function of a complex variable may be interpreted physically as 
being two-dimensional potential functions. Any two functions of the 
variables « and y which satisfy the Cauchy-Riemann Eqs. 12 and 13 are 
said to be conjugate potential functions. Since Eqs. 12 and 13 become 
interchanged when wu is replaced by v and v by —4, it is clear that the 
identities of these two functions become interchanged when the algebraic 


*An application of this sort is discussed in Art. 27. 
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sign of one of them is reversed. In the present article the properties of 
these functions are investigated in more detail. 

The point of departure in these discussions is the Cauchy integral 
formula, Eq. 80. A given function f(z) is assumed to be regular within the 
region enclosed by the circle shown in Fig. 21 as well as on this boundary. 
If ¢ denotes any point on the circle and z is any internal point, 


f(@) = = P HO. dt [305] 


Fic. 21. Change of variable relevant to the derivation of Poisson’s integrals. 


If, in this integral, z is replaced by some point 2* external to the circle, 
the integrand is regular for all points enclosed by the contour C, and 
according to the residue theorem or Cauchy’s integral law, the value of 
the integral is zero. That is, 


gee fo) 
The point z* is now so chosen that 
Is? _ 
w= SHS [307] 


in which the bar indicates the conjugate value.* Using this relation, one 


finds 
1 1 | c E 
aS = - UF Se HH 1 ; 308 
g-s  ¢-* slt-s on [308] 

*The point 2* thus determined is called the image of the point z with respect to the circle. 


The two points z and 2* are geometrically related as shown in Fig. 21. This item is discussed 
in Art, 24, 
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Addition or subtraction of Eqs. 305 and 306 is then seen to yield 
1 g 
f(s) = +40) + 5 $| 


oa cee ay dg 
pie et} 54 [309] 


in which Eq. 305 is also used to obtain the particular relation 


ee dy ; 
£0) = oS f(s) : [310] 
| Since, according to the notation given in Fig. 21, 
z= reid [311] 
and 
¢ = Ref? [312] 
one has 
iv R? — ¢Rei(¥-®) 
= py = a [313] 
¢—z  Re’* — re? R* + 7° — 2rR cos (y — ¢) 
Hence 
PS ee os See cee SO). [314] 
t—2 ¢-2 R*%+?7? — 2rRos (yy — 6) 
and 
$f __2R’—2rR cos (Wd) 4 RP 
t—-z ¢—z R®+1r?—2rR cos W—4) 1 RP 2 cos (¥—¢) [315] 
Equation 312 is used to obtain 
aim [316] 


Substituting these results into Eq. 309, one obtains the following 
integral representations: 


_ at pat rR sin (¥—¢) | 
0) = 40) +5" geen popIOO H [317] 


and 


RY be R?-? 
f@) => f +7 — Roos 6) (Ry) dp [318] 


It is useful to observe that the quantity rR sin (¥ — ) represents the 
area of the parallelogram determined by the vectors ¢ and z and that 


R? + 7? — 2rR os (Wy — ¢) = [¢ — 8)? [319] 
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is the square of the distance between the points ¢ and z. It may, therefore, 
be concluded that the formulas given by Eqs. 317 and 318 are independent 
of the location of the center of the circle C, although in the above deriva- 
tion the center is for convenience chosen to coincide with the origin of 
the z-plane. 

If the function f(z) is written more explicitly as 


f(z) = u(7,6) + jo(r,¢) | [320] 


and Eqs. 317 and 318 are separated into their real and imaginary parts, 
one obtains the following two pairs of relations respectively: 


wird) = wo) +2 [EEO ey ay [32t] 


1 rR sin (Y — 4) 


v(r,6) = v(0) ~ oa ar ae u(Ry) dy [322] 
and 
2a R2 
u(r) = 5- e funn dy [323] 
rd) =f RM [324] 


in which the relation 319 is used for the sake of abbreviation. It is ob- 
served that the formulas in each pair become interchanged when wu is 
replaced by v and v by —w. 

The results expressed by Eqs. 321 and 322 show that the real and 
imaginary parts, « and 2», of a function of a complex variable are, except 
for an additive constant, explicitly related to each other. The real part 
determines the imaginary part, or vice versa. Hence, given either part, 
the corresponding complex function may be found. The formulas 321 
and 322 are said to yield the conjugate potential function to any given 
potential function or to transform one such function into its conjugate 
mate. In this sense they are sometimes referred to as a pair of transforms. 

The integrals 323 and 324 likewise form a pair, but they do not ex- 
press one potential function in terms of the other. Instead, they yield 
the real and imaginary parts of a complex function at any point within 
the given circle in terms of their respective values on the boundary. If a 
potential function is known to be regular for all points on a circular 
boundary and within the enclosed region, it is there uniquely determined 
in terms of its boundary values by means of the formula 323 or 324. 

For r = 0, these integrals yield 


“(0) = _ I “u(Rw) ap [325] 
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and 
(0) = 5- [" oy) a (826) 


These results state that the value of either potential function at the 
center of the circle is equal to the average of its values on the circular 
boundary. As a consequence it follows that the largest and smallest of 
the values which and v assume throughout a circular regularity region 
must occur on the boundary of that region, for if such an extremum 
occurred at some internal point, the conditions stated by Eqs. 325 and 
326 could not be fulfilled for a small circle with its center at this point and 
its boundary within the original one. 

These conclusions yield a useful theorem to the effect that if a function . 
of a complex variable is regular over a given region, the maximum and 
minimum values of its real and imaginary parts must for that region occur 
on the boundary.* The region evidently is not restricted to be circular in 
form since the above reasoning is equally applicable when the boundary 
of the regularity region is arbitrary. 

The formulas 321, 322, 323, and 324 are known as Poisson’s integrals. 
By a combination of Eqs. 322 and 323 a further result of practical utility 
is obtained. The first step is to form 


u(r.) + jo(r,6) 


1 2* R? — r? —72rR sin (p 
= jox0) + = J, BER RW) dp [327] 


From Egs. 311 and 312 it is seen that 


c+e Ret rel? R= — 72rR sin (W — 4) [328] 
t—2 Re¥—reit R? +7? — 2rR cos (vy — ¢) 


so that Eq. 327 may be written 


; 1 i 
ft) = j00) +52 eu) ay [329] 


By means of the series representation 


ftFo142(24+5 se) [330] 


*This result, although similar to that stated by the principle of the maximum modulus, 
should not be confused with the latter. As pointed out in the previous article, the modulus 
a/v + 2% attains its minimum as well as its maximum value on the boundary of the region 
of analyticity only if the function has no zeros within this region. That is, if the function has 
zeros there, the modulus attains its maximum value, but not its minimum value, on the 
boundary. 
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and Eq. 325, this result may be rewritten in the form 


fire) = 70) +2 [" (2+ e +---)u(Rw) de (331) 


The series in Eq. 330 is uniformly convergent for 


3 < 1, so that the in- 
tegration in Eq. 331 may be carried out term by term. When this is done, 
with ‘ 
1 Qa . . 
On = — ff u(Ry)e"¥ dp [332] 
WT 


it is seen that the result given by Eq. 329 may be expressed in the alterna- 
tive form 


f(r,6) = f(0) + = (FY ancint [333] 


Fic. 22. Limiting process which converts Poisson’s integrals into Hilbert transforms, 


By means of either Eq. 329 or Eq. 333, the complex function is deter- 
mined within the circular region in terms of the values of its real part on 
the boundary. 

Other forms for some of these results, more appropriate to the condi- 
tions encountered in electric circuit theory, are obtained through assum- 
ing that the circle in the preceding derivations is an extremely large one 
lying in the right half of the z-plane, tangent to the y-axis at the origin. 
If this circle is imagined to become infinite in diameter, the entire right 
half plane will constitute the enclosed region and the imaginary axis 
(or y-axis) will become the boundary, which then closes upon itself by 
passing through the point at infinity. The manner in which the integrals 
317 and 318 are to be interpreted in this limiting case is clarified some- 
what when the situation is pictured in Fig. 22. The center of the circle of 
Fig. 21 is imagined to lie on the real axis at a point infinitely far to the 
right. The point ¢, which in Fig. 21 is a point on the circular boundary, 
becomes the point jy on the imaginary axis, and the point 2 is character- 
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ized in terms of the variables « and y of the plane of Fig. 22 as the point 
x+y. ; ’ 

In order to evaluate the forms which the integrals 317 and'318 assume 
when this limiting process is carried out, one should observe that the 
appropriate transition may be indicated symbolically as follows: 


rsin (¥ — ¢) > (vy — 9) [334] 
R? — 72 — R? — (R — x)? > 2Rx [335] 
and 
|¢ — 2? >? + (2)? [336] 
whereas 
Rdy > —dn [337] 


in which the minus sign appears because the positive direction of traversal 
of the circular boundary in Fig: 21 is counterclockwise and in Fig. 22 
this corresponds to the negative direction for the y-axis. Finally it should 
be observed that the limits of integration corresponding to a counter- 
clockwise traversal of the circle are from + © to — ~, or from —~ to 
+ if the algebraic sign of the integral is reversed. 

When these substitutions are made and f(z) is assumed to vanish at 
infinity,* the integrals 317 and 318 become in the limit 


= (y — afin) a 


«x? + (y— 7)? [338] 


: 1 
f(z) = u(x,y) + jo(%,y) - 
and 


. 1? «f(jn) 

f@) =uley) tie) =f a Petes 1599] 

The transformation of the Poisson integrals, which in their usual form 
apply to a circularly bounded region, to the forms given by Eqs. 338 and 
339, which apply to the right half plane, may be carried out in a some- 
what more satisfactory manner (than the heuristic one just given) by 
use of a linear fractional transformation (discussed in Art. 24) which 
effects the mapping of the interior of a circle about the origin upon the 


right half plane. 
From Art. 24, one finds that the transformation which reads 
2 —a@ . 
ae rar [340] 


transforms the interior of the unit circle in the z-plane into the right half 


*The validity of the integral in Eq. 338 is restricted to this condition. The term f (0) in 
Eq. 317 becomes f( 0 )} and drops out. : 
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of the z/-plane, with the origin of the z-plane corresponding to the point 
z’ = aon the real axis of the z’-plane. As the unit circle in the z-plane i is 
traversed in the counterclockwise direction, the imaginary axis in the 
z'-plane is traversed throughout its entire extent from +7 to —jo 
(that is, in the negative direction). 

If z is some point within the unit circle, 2’ is a point within the right 
half plane. Toa point ¢ on the unit circle in the z-plane there corresponds 
a point ¢’ on the imaginary axis of the z’-plane which likewise is deter- 
mined from the transformation 340, that is, 

v-a —a 
f= Arar [341] 

The desired transformation of the Poisson integrals is carried out 
through returning to Eq. 309 and introducing there the change of variable 
expressed by Eqs. 340 and 341. One finds from a simple calculation that 


_f$ _ =a)’ +2) 


t-—3 2a(¢’ — 2’) [342] 


and 


dg 2a dg’ 
gs ale’ +a) 
The integrand appearing in the integral of Eq. 309, therefore, becomes 


{« — a)(2' + a) —  — ale a S(g’) dg’ 
ge! 3! gc’ — a)(g’ +a) 


It should now be observed that one wishes to have the origin of the 
z-plane correspond to a point in the z’-plane which is infinitely remote 
from the origin, because the value of the function at the origin in the 
z-plane (the quantity f(0) in Eq. 309) is then carried over into the value 
of the function at infinity in the z’-plane. Under the assumption that the 
function vanishes at infinity (which must, of course, be met by any 
physical problem to which the resulting formulas are applied), the term 
involving f(0) in Eq. 309 then drops out. 

Although it is not possible to determine a linear fractional transforma- 
_ tion which maps the interior of a circle about the origin upon the right 
half plane in such a way as to make the origin or the center of the circle 
correspond to the point at infinity (this would require an infinitely large 
value for the quantity a in Eq. 340), the desired end may be achieved in 
the present problem by considering the limiting process indicated by 
a— to be applied to the expression 344. The result reads 


1 1 ty gpl 
oF S5 a7} f(g") a [345] 


[343] 


[344] 
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Substituting this result for the integrand appearing in Eq. 309, dis- 
carding the term (0) for reasons already stated, and noting that the 
algebraic sign of the integral is reversed if the integration is extended over 
the imaginary axis of the 2’-plane in the positive direction, one has after 
dropping the primes on the quantities z and ¢ 


w= pf fares noe 6] 


which is the desired result. It is readily brought into the more explicit 
form given by Eqs. 338 and 339 through writing 2 = « + jy for any fixed 
point in the right half plane and ¢ = jy for the variable point of integra- 
tion along the boundary or imaginary axis. The two algebraic signs 
appearing in the integrand then yield respectively the results 


_1 ¢° (afin) 
F(z) Se = ae d [347] 
and 


10) =f" oe oe [348] 


which agree with Eqs. 338 and 339. 
Separating real and imaginary parts yields the pairs 


1 7 (y= 1)vO,n) 


wey)== fate en [349] 

vey) = — 2 f Gene, [350] 
and 

weg) <2 = On an [351] 

v(ay) = : f : nen ay [352] 


The last two integrals determine the potential functions in the right 
half plane in terms of their respective values on the imaginary axis, which 
is regarded as the boundary of the right half plane. The individual rela- 
tions in the pair 349 and 350, on the other hand, may be used to determine 
either of the potential functions in the right half plane in terms of the 
boundary values of the conjugate function. A potential function along 
the boundary is given in terms of the conjugate function by either of the 
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integrals 349 or 350 for x = 0. One thus obtains the pair of relations 


v(m) dn 
--f[" aoe [353] 
and 
u(n) dn 
-if" a [354] 


which are known as Hilbert transforms. 

Because of the singularity of the integrand at the point 7 = y, the 
value of either of these integrals, in the ordinary sense, does not exist. 
It is, however, possible to overcome this difficulty* by the definition of a 
particular process of evaluation yielding the so-called Cauchy principal 
value. This value is obtained through approaching the point » = y 
symmetrically from both sides, as indicated for the integral 353 in the 
following expression: 


= o(n)dn_,, aI -* a(n) dn ~ 20) 69) 
Lag mim J, ea mee (355) 


In order to see that this limit has a definite value, one may represent 
the function v(y) as 


v(m) = vy) + (n — y) - POA) [356] 
in which p(y,n) is regular in the eae of the pu n = y. Then 


ia. v(n) dn m 


The principal value of the first of the two es on the right-hand side 
is obtained through the following steps: 


Ca ee Oe 


[358] 


[357] 


— 7 


When this is written in the form 
ly +e ” 
[»o-»f"-[mo-2)| [359] 
ye -—«° 
the limiting process is found to yield 


ws [n(=)]- me [aG2)]-0 


*See E. C. Titchmarsh, “ Conjugate Trigonometric Integrals,” Proceedings of the London 
Mathematical Society, Ind series, 24 (1926) 109-130; “ On Conjugate Functions,” ibidem, 
29 (1929) 49-81. 
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Hence only the second integral on the right-hand side of Eq. 357 remains. 
According to Eq. 356, the integrand p(y,) of this second integral is 
observed to decrease no faster than 1/7 for large values of » regardless of 
whether »(n) vanishes or remains finite for 7 — ~. Although it is true 
that the integral from zero to infinity of a function which decreases no 
faster than 1/n for large 7 does not remain finite, it is essential in the 
present instance to observe that if the limits of the integral are from — © 
to », and the integrand is an odd function about the point 7 = ~, one 
can appreciate in a general way how a finite value can result by noting 
that the contributions to the integral on the positive and negative sides 
of the point » have opposite algebraic signs. This is the same sort of 
reasoning as that involved in deriving the finite principal value 355. 

By reference to Eq. 356 it is seen that if v(m) vanishes for 7 > , then 
b(n) is an odd function about the point » = ©; but if v(m) remains 
finite in this vicinity, p(y,n) is odd about 7 = © only if v(m) is an even 
function; that is, if v(—1) = (7). Hence one is led to the conclusion that 
the given potential function v() either must vanish for 7—> © or must 
be an even function of 4. Inasmuch as an odd function about « must 
necessarily be either zero or infinite at infinity, it is sufficient to state 
merely that the potential function »(7) shall remain regular in the 
vicinity of 7 = «.* Since all the above discussion applies equally to the 
evaluation of the integral 354, the same final comment applies also to 
the conjugate function “(7). 

It is thus seen that the transforms 353 and 354 may be written in the 


alternative form . 
v(n) — vy) 20) ay . 
2 ae [361] 
--f" uy) = 40) 5, [362] 
yu 


Here the functions v(7) — v(y) and u(y) — u(y) vanish in the vicinity 
of the point 7 = y at least as strongly as the factor (y — 7), so that there 
can be no question as to the finiteness of the integrals so far as the 
presence of the factor (y ~ 7) in the denominator of the integrand is 
concerned. 

In the evaluation of the integrals 351 and 352 for the boundary « = 0, 
a difficulty arises due to the fact that the integrand vanishes everywhere 
with the possible exception of the point 7 = y. Yet it must be true that, 
in the limit x — 0, the integrals yield the values «(0,y) and v(0,y). That 
they do so, may be shown in the following way. 


“For a more thorough discussion of the question of convergence of the Hilbert transforms 
the reader is referred to the article by E. C. Titchmarsh already cited. 
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Since the only possible contribution to the value of the integral in the 
limit x + 0 must come from the immediate vicinity of the point 7 = y, 
it is clear that for example, the function «(0,n), in Eq. 351, may be re- 
placed by its value «(0,y) at the point 7 = y, and since it then has 
nothing to do with the process of integration it may be placed in front of 
the integral sign. It then remains to show that ; 


ON of" d 
Hop) = int [seo] BO 


or that 
fl (2% «dn arr 2) 
timit {2 "= 5 = timit| * tan ee =A [364] 


which is evidently true. 

To return to the integrals 353 and 354 (or the equivalent ones given by 
Eqs. 361 and 362), it is useful to observe that if v(m) is an odd function 
of n, then u() is an even function, and vice versa. To see this, one may 
write the integral 353, for example, as 


_1 f° wa)dn | 1 f° 0(n) dn 
wy) =f one += f = [365] 
or 
1 f-° d 1 d 
wy) = — 2 fan Spo [366] 


If v(m) is assumed to be odd, replacing the variable of integration 7 in the 
first of the above two integrals by —7 shows that 


_1 (7? ra)dn_ 1 f- ete) dn [367] 
w y- 4 rJo ytn 
Hence Eq. 366 becomes 
-{" 1 1 ) 
=-— - v(n) ad 368 
u(y) ot GS yaa (1) dn [368] 
or 
_ _ 2 £2? _n(n) 
u(y) = — =f 7-9 dn [369] 
From this form it is immediately evident that «(y) is an even function 
of y. 


By means of an entirely parallel process, one may transform the 
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integral 354 with «(n) assumed to be even and obtain 
_ 2y f u(n) 
vy) = Pn dn [370] 


from which it is apparent that v(y) is an odd function of y. 

It is to be observed that the transforms 369 and 370 are no longer 
general but apply only to cases in which w is even and v odd, as is assumed 
in their derivation. This restricted character of the forms 369 and 370 is 
recognized at once from the fact that these integrals no longer become 
interchanged when wu is replaced by —v and v by —w. 

One might just as well have derived special forms for the opposite 
assumptions, namely that u be odd and v even. However, for the rational 
functions and many others met in practical problems, the real part is 
even and the imaginary part is odd when regarded as a function of y for 
x = 0, that is, for points on the imaginary axis. The following more 
detailed discussion pertains to this important special case. 

If the same method of transformation is applied to the equivalent 
forms 361 and 362, the results obtained are 


28 = 
iQy= ‘ft wn) WOES WO) 5, [371] 
Py 
and 
2 ou —4u 
oly) = 2 2 = s FF [372] 
The last of these is the same as 370 except that (7) — u(y) replaces 
u(n).* 


It should be mentioned shat the even function u(y) is determined by 
the integrals 369 or 371 except for an arbitrary additive constant. How- 
ever, the difference between any two values of u(y) for specified y-values 
must be uniquely determined. For example, using the integral 369, one 
finds 


ue) = 40) =2 [792 [373] 


which is a simple expression for the difference between the values of u 
at the two extremes in the range of y-values. 

This last expression may be put into a practically more useful form 
through introducing the change of variable indicated by 


6 =In (2) ‘Qae [374] 
Yo n 


*If a given potential function is constant, the conjugate function is zero. Therefore, the 
conjugate of #() + constant is the same as the conjugate of u(y). 
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which amounts to introducing a logarithmic scale in place of the linear 
n- or y-scale and choosing the arbitrary point 7 = yo as the new origin. 
If v(@) represents the function »(y) with respect to the logarithmic 
6-scale (that is, v(6) is v(m) plotted on a logarithmic scale), the relation 
373 takes the form 


a\woyre ff" »(0) do [375] 
or 


SE 9) 6 = © [u(2) — wo) [376] 


in.which u(« ) and «(0) still denote the values of u for y = © and y = 0 
respectively.* 

This result states that the net area under the curve for the function 2, 
plotted on a logarithmic scale, depends only upon the difference between 
the values of the corresponding u-function at the extremes of the y-scale. 
Although this result is only a partial statement of the implicit relation 
between the conjugate potential functions and 2, its simplicity makes it 
particularly useful in connection with the practical problem discussed in 
the references just cited. 

A comparable result obtainable from the integral 370 reads. 


limit bo)] = 2 [wea dn [377] 


Another useful particular relation between « and v for points on the 
imaginary axis ({ = jn, = jy) may be obtained from the integral 372. 
The latter may be rewritten in the form 


1 f° u(n) — uy) , dn 


u(y) = a7 ae 72) 7 [378] 
2 ” 
Introducing the change of variable 
pein () dp = 2 [379] 
0 
and writing «(@) for the function u(y) in terms of the new logarithmic 
variable @ give 
1 f° uo) ~ u(0) 
10) = at i sinh @ as [380] 


*This particular form as well as the ones given by Eqs. 377, 380 and 393, were obtained by 
H. W. Bode. See his U.S. Patent 2,123,178; also “ Feedback Amplifier Design,” Bell System 
Technical Journal, 19 (July 1940), 421-454. 
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The integrand in this integral remains regular throughout the entire 
range —«© <6 < o. At the limits 9 = +, u(6) has the finite values 
of u(y) for 7 = © and y = 0 respectively, aieress 2 sinh@— +e'®l, al 
so that the integrand is seen to behave like e~'*! for large positive or 
negative values of 6. For the vicinity of the point 6 = 0, 


u(0) — (0) 0: (2) | [381] 


and 
sinh 6 & 6 [382] 
so that the integrand there maintains a finite value. 


The integral 380 may be transformed into a more useful form ‘through 
the method of integration by parts, which reads 


f2ra=ta- faa [383] 
Letting 
p = u(b) — u(0) [384] 
and 
do 
dq re) [385] 
one has 
du 
p= (=) ds [386] 
and 
6 6 
g = Intanh 5 = — Incoth; [387] 


Thus Eq. 380 becomes 


o(y) = 2[ {u() — u(0)} In tame s+ 1 7 (2) an cotn la 


” 1388] 


in which the absolute value of the variable @ must be used because the 
integral 380 has a real value and hence the argument of the logarithm 
must remain positive. Since the quantity «(@) — «(0) appearing in the 
first term of Eq. 388 remains finite at the limits @ = -+., the value 
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of this term is zero at both limits. Its values throughout the range 
—«o <@< ©, moreover, are finite, since at the critical point 6 = 0 it 
has the value 


du 6 
(4) 6-In 3 7 0 [389] 
This term, therefore, vanishes and there remains | 
_! a 
f- =) im coth Wha [390] 


The factor In coth appearing in the integrand has a logarithmic 


infinity at the point 6 = 0 and is a symmetrical function about this point, 
dropping off rather rapidly on both sides. Since 6 = 0 corresponds to 
1 = 4, it is seen that the value of the function v(y) at any point is largely 
determined by the slope of the conjugate function u(y) at that point 
_ when plotted on a logarithmic scale. 

This interpretation of the result expressed by Eq. 390 may be clarified 
through writing 


“\ du dy 4) *) 

=y(“) =(% 391 
doug \dn d0)ray >” \dn)y-y  \d8)o [391] 
and noting that* 


a i im coth lw=2 f° In coth = Sa = | [392] 


Then, by adding and Se (du/d0) to a d@) in the integrand of 
Eq. 390, one obtains the form 


o(y) = *(@) + : f ‘ (=) - (=)} in coth Hl ay [393] 


Here the first term represents the major contribution to the function 
v(y). If u(6) is a symmetrical function about a given point 7 = ¥y (that is, 
@ = 0), then (du/d9) — (du/d8)o i is an odd function about this point and 
(because In coth [|@|/2] is even about 6 = 0) the contribution of the 
integral in Eq. 393 becomes zero. Its value in any case is seen to depend 


*Making the change of variable x = coth @/2, one finds 
ies 0 @Inxdxs x 
uA ma >. J i-2 2 


which is No. 3 of Table 187 in Bierens de Haan, Nouvelles tables d’intégrales définies (Engels, 
Leiden), 1867. 
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only upon the asymmetry of the function ~(6) about the point 9 = 0, and 
because of the critical character of the factor In coth (|6|/2), only the 
asymmetry in the more immediate vicinity of this point can have an 
appreciable effect. 

Although the computational labor involved in the solution of many 
problems of this sort may be quite heavy, the necessary theoretical basis 
for such a solution is evidently afforded by the Hilbert transforms or by 
modifications of these. However, when certain aspects of electric circuit 
theory are dealt with, a somewhat more complicated version of the 
problem presents itself. Instead of either u(y) or v(y) being specified over 
the entire range 0 < y < ~, u(y) is specified only over portions of this 
range and v(y) is specified over the remainder. For example, «(y) may be 
specified for 0 < y < yo and v(y) for vo < y < ©. The problem is to 
determine u(y) and v(y) for the ranges in which they are not specified,* 
that is, the object again is to determine the whole complex function over 
the entire range 0 < y < o. 

The clue to the method of solution lies in dividing the complex function 
u(y) + jv(y) by the irrational factor V1 — y?/yo?, which is real over the 
range 0 < y < yo and imaginary over the complementary range yo < 
y < o. Hence the resulting function reads 


107 a ; oe = for0<y<y40 [394] 
ie ae Foie 
: Yo" Yo" 
and 
vy) . u(y) 


ae JG ~ for cy <n [395] 

Ve-1 V5 

og = I a = 1 

Yo Yo 
The significant part about this result is that the real and imaginary 
parts of the resulting function alternately involve « and v in the two 
ranges. The same effect may, of course, be obtained if the complex func- 
tion w + jv is multiplied instead of divided by the irrational factor, but 
the resulting real and imaginary parts might then no longer remain finite 
in the vicinity of the point y = o. Also, one could presumably find other 
irrational functions which are alternately real and imaginary in the 
ranges 0 < y < yo and yo < y < , but the one used above is perhaps 


the simplest. 
Substituting the real and imaginary parts of the functions 394 and 


*The solution to this variation of the problem was obtained by H. W. Bode, Joc. cit. 
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395 for u(y) and v(y) in the transforms 369 and 370, one finds respectively 


nal. 0(n) dn 
vis n? 
bo Gt Sa ve 


ur for 0 < y < yo 
. ~ 2 
42 a WOO ce Yo [396] 
sf 2 2) {2 v(y) 
(n? — y*)a{—s — 1 == fory<y< ao 
Yo Yo y 
—-1 
Yo 
and 
2y rv u(n) dy 
sa (7? eft 
0 n ¥ yo? 
ard for 0 < y < yo 
y 
1--; 
- 2 
+2 a Sy . [397] 
(ot = 9) - 1 ee) ee ee 
7) Yo yy 
ay =i 
. ¥Y¥o 


Equation 397 is used when “(y) is specified over the range 0 < y < yo 
and v(y) over the range yo < y < ©; Eq. 396 applies when the reverse 
is true. As an illustrative example, one may suppose that «(y) is specified 
to be constant throughout the range 0 < y < yo and that v(y) is constant 
over the complementary range. Inasmuch as u(y) is, by the present 
methods, determined within an arbitrary constant, one may simplify 
these given data somewhat by assuming #(y) to be zero over the range 
0 < y < yo and adding the desired constant value to the whole solution 
afterward. Thus one has for the given data 

u = r 0 
(vy) =0 for0<y<y [398] 
v(y) = % fory<y< 


When Eq. 397 is applied, the first integral drops out because w = 0. 
The remaining expressions are somewhat simplified by the substitutions 


n ¥ 
£=— == 399 
Yo oe Yo [399] 
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One then has 
v(Xo) : 
—==——  for0<x% <1 
2X f - dx 2 V1 — xo? [400] 
lea ee ho 0%) for 1 <x < 
V Xo" —1 . 


The integration may be accomplished through noting first that 
i f Ree RS f bg 
oA (x? — ao?)V a? —1 0 241 (e — 49) V x? — 1 


1 f dx 
_- {| ————— (401 
241 (x + x9)V x? — 1 [401] 
Using the substitution 


a XxXq — 1 
OF eee es [402] 
one finds 
dz 3 LS Xo? dx [403] 
VI-—- #2 (x —%)Vx?—1 
whereas with — 
anions [404] 
wet Xo 
there results 
—_ a 2 
dz V1 = xo? dx [405] 


Vi-# @+n)Ve—1 


The changes in the limits of integration which accompany the substitu- 
tions 402 and 404 being noted, the integral on the left-hand side of Eq. 
401 is found to be given by 


walle TAS] [406] 


The integrands in these integrals are even functions. Hence the integral 
from 1 to x) may be replaced by one from —9 to —1. The two integrals 
in the expression 406 may then be combined, giving 


2vpXo f dx Z Vo f dz [407] 
T (x? — x97)Va7 -~ 10 eV 1 — xp? ¥-m V1 — 2? 
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The integration is now no longer difficult. Equation 400 is thus observed 
to yield for 0 < % <1 j 


v(%o) = 28 sin! x9 = a In (V1 = 29” + jxo) [408] 


whereas for 1 < x < ©, in which range it is appropriate to consider the 
right-hand side of Eq. 407 in the form 


—p 0 dz 
Ss ———— 409 
TV Xo" —1 L 2 — 1 [ ] 
the integration yields the real part 
2 
u (i) = In (Wao? = T + 2) [410] 


One readily recognizes from these results that the complete complex 
function is given over the entire range 0 < y < © by the expression 


2 7 j 
u(y) + jo) = Mm (ft - 5 +22) [411] 


The procedure may readily be extended to problems in which the range 
0 <y < © is divided into more than two subranges. For example, if u 
and v are alternately specified in the ranges O<y<yw,y<y< V2, 
yo <y < ~, the Hilbert transforms are applied to the real and imagi- 
nary parts of the function (« + jv) divided by V (1 — y?/y12) (1 — y?/y_2). 
The determination of the appropriate detailed relationships follows the 
same pattern as given above for the simpler problem involving two sub- 
ranges. 


23. MORE ABOUT POTENTIAL THEORY AND CONJUGATE FUNCTIONS 


Additional useful physical interpretations relevant to properties of 
functions of a complex variable may be had from pursuing further the 
intimate relationships between certain of these properties and potential 
theory. The starting point for the present discussion is the simple physical 
situation shown in Fig. 23. The plane of the paper, which is also regarded 
as the complex z-plane, represents a plane perpendicular to an infinitely 
long straight conductor. The origin of co-ordinates for this plane is chosen 
coincident with the conductor, which is assumed to carry a uniform linear 
distribution of charge of g coulombs per unit length and also a uniform 
current of ¢ amperes whose reference direction points upward. . 

According to elementary theory, the electric field intensity 6 and the 
magnetic flux density B at the point zo are given by the negative gradient 
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of corresponding potential functions V and F, that is, 
&=—grad0 B= — grad F [412] 


with 
2 1 
0 =—In- [413] 
€ r 
and 
Fic. 23, The plane perpen- = —2yip [414] 
dicular to an infinitely lon; . ni ; Seger 
straight charged ae ted in which € and ,» are the dielectric permittivity 
carrying current. and magnetic permeability respectively of the 


homogeneous isotropic surrounding medium.* 
The field components in polar co-ordinates are 


30 x0) 

an ee rrr [415] 
oF oF 

B, = — 3 Be — Tae [416] 


—_ a 5 &= [417] 


B,=0; B= | [418] 


It is now useful to observe that if one defines a single complex potential 
function 


0 FL ott 
w= 2q +Ioq= In re ahs [419] 
in which 
Zo = rei? [420] 


the field components are determined from 


aw e&, . By, 
~ er 2g 19 Oni [421] 


and 
dw = 2&4 ° By 
r 0d 2q Qui 
*Unrationalized MKS units are used in these discussions. 


[422] 


Art. 23] POTENTIAL THEORY AND CONJUGATE FUNCTIONS 351 


or the resultant fields from the relation 
| B 
Qui 
The real and imaginary parts of the gradient of the complex potential 
function w yield the electric and magnetic fields respectively (except for 
constant multipliers). 


It is thus seen that the real and imaginary parts u and v of the simple 
function 


& 
~ grad w= 5 +9 [423] 


oxen ees [424] 
20 

when multiplied respectively by the constant factors 2g/e and 2yi are 
the scalar electric and magnetic potential func- 
tions for the single ‘charge and current-carrying 
conductor of Fig. 23. This example, incidentally, 
affords an interesting physical interpretation of the 
multivalued character of the logarithm function. 
Since, for the scalar magnetic potential F, the 
radial lines are equipotential loci, this function 
changes continuously in the same direction as 
one proceeds along a path encircling the con- 
ductor, each complete revolution yielding an 
increment proportional to 27. If F is likened to Fic. 24. The plane 
an altitude function in a physical terrain, the Perpendicular to a pair 
similarity of the Riemann surface to a winding oe ee ny ease! 
ramp is realized. The branch point about which jeg citrent in Sopoatie 
the surface winds becomes, in terms of this mag- directions and carrying 
netic field interpretation, a vortex thread which is opposite charges. 
physically realized by the current-carrying con- 
ductor. This singularity for the field and the function representing it is 
indeed the seat of its excitation. 

A pair of identical conductors are now considered. They are spaced 
symmetrically with respect to the origin as shown in Fig. 24 and are 
assumed to carry charges and currents of equal magnitude but opposite 
sign. The complex potential function in this case is, with reference to 
the notation indicated in the figure, 


1 1 


eG ea [425] 
or 
w =In (2+*) - [426] 
40 —~ @ 
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It is now assumed that the spacing 2a between the conductors is very 
small compared with the distance to any point zo at which the field (or 
the resulting function) is to be studied. For the moment this spacing 
may be regarded as being finite, although for some of the subsequent 
discussions it is more appropriate to allow the limiting process a — 0 to 
be completed. At all events the condition 


a 
<1 427 
Fs a 
is assumed to be fulfilled, so that Eq. 426 is replaceable by 
woxin(1 +2) 2% [428] 
20 20 


with any accuracy that may become necessary. This form for w may be 
obtained through expanding the logarithm in a Maclaurin series in terms 
of the variable 2a/zo and retaining only the first term, or it may be derived 
in a somewhat more lucid fashion through first writing the right-hand 
side of Eq. 425 in terms of the defining integral for the logarithm, thus 


+a q —a g 
w =In (% +4) —In @—a) =f =- f° 7 [429] 


which is equivalent to 

tad 
maf 
If the condition 427 is fulfilled, then the points z + @ and z — @ are so 
close together that this integral is very nearly given by 
1 Zzo+a 2a 


w= 
0 Vzo—a 20 


w= [430] 


[431] 


which agrees with Eq. 428. 

As stated above, the conductors in Fig. 24 also carry currents i having 
equal magnitudes and opposite directions (upward from the plane of the 
paper for the conductor carrying +g) so that the real and imaginary 
parts of w again represent scalar electric and magnetic potential functions 
when multiplied by their appropriate factors. These, incidentally, may 
just as well be assumed to be numerically equal, as expressed by 


q 


3 = ee (numerically ) [432] 


Then the common factor may be included in the expression for w. Since 
this procedure facilitates matters somewhat, it is assumed for the follow- 
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ing discussion that 


ey [433] 
&Zo 
It is now possible to let the spacing 2a become as small as desired 
without having the magnitude of w become small at the same time, since 
one can demand that as 2a is made smaller and smaller, the charge 
be made correspondingly larger so that the product 


aq =m [434] 


remains constant. In the limit a—0, g approaches infinity. This limiting 
configuration of oppositely charged conductors is a useful concept and 
is referred to as a dipole or doublet. The quantity m, called the moment of 
the dipole, is considered to be a vector coincident with the line joining 
the two charges (the axis of the dipole) and directed from the negative to 
the positive charge. The vector character of this dipole moment is taken 
into account through its being considered a complex number although 
in the present example it is real, as is evident from inspection of Fig. 24. 
The complex potential function for the dipole is 


we =e [435] 
and the scalar electric and magnetic po- 


tential functions are the real and imagi- 
nary parts 


2m 
Py 43 
“= ~~ cos o [436] 


and 


0= — = sin ¢ [437] 

Fic. 25. A dipole of moment m 

Although in this derivation the dipole with arbitrary position and 
is assumed to have a very particular lo- orientation. 

cation and orientation in the z-plane, 

namely, that shown in Fig. 24, one may readily generalize the expression 

435 so that it becomes appropriate to a dipole in an arbitrary position 

and orientation as is shown in Fig. 25, where the dipole moment makes 

an angle y with the reference axis. The dipole is located at an arbitrary 

point z, so that one now has 


(20 — 2) = re’* [438] 
With m denoting the magnitude of the dipole moment, it is readily seen 
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that the appropriate modification of Eq. 435 reads 


2me?* m , 
2 a BS Id) 439 
e &(Z9 — 2) er ° ey 
which has the real and imaginary parts 
2 
u = — cos (¥ — $) [440] 
er 
and 
am ., 
v= — sin W — ¢) [441] 


It is significant that if the dipole is rotated through 90 degrees in the 
clockwise direction — that is, if y is replaced by ¥ — (4/2) — then u 
becomes replaced by 2, and »v by —w. In other words, rotation of the dipole 
through 90 degrees in the clockwise direction has the effect of replacing 
each potential function by its conjugate. Further discussion of this useful 
observation is given presently. 

The preceding derivations also show that the dipole may be regarded 
as the result of merging two logarithmic singularities of opposite polarity 
and infinite magnitude. As Eq. 439 shows, the end result is a simple pole, 
and the complex residue of the function w in 
this pole, except for the factor 2/e is numer- 
ically equal to the vector moment me’* of 
the dipole. One thus obtains an interesting 
physical interpretation for the kind of sin- 
gularity designated as a simple pole, 
namely, that it may be visualized as the 
seat of an electric or magnetic dipole with 
its moment proportional to the residue of 
the function in this pole. , 
Fic. 26. A 2n-pole centered at It is logical to continue with a similar 
the origin. The position of + study of poles of a higher order of multi- 
and — charges is correlated to pjicity. As the following discussion shows, 
the form of the resultant com- th ted hy thi A f 

plex potential function. ese are represented by the merging 0 

symmetrical configurations of larger num- 
bers of charges having equal (infinite) magnitudes but signs which are al- 
ternately positive and negative, there being an equal number of each. For 
example, the configuration following the dipole in its order of complexity 
is the so-called quadripole, consisting of two positive and two negative 
charges arranged alternately and spaced symmetrically on the circum- 
ference of a (vanishingly) small circle of radius a. Next in the order 
of complexity is the configuration involving six charges of which three are 
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positive and three negative, and so on. In general such a configuration is 
referred to as a 2n-pole, m being the number of positive or negative 
charges. The dipole is a 2n-pole form = 1; the quadripole is a 2n-pole for 
n = 2, and so forth. 

Figure 26 shows a 2n-pole for » = 6 with its center at the origin of the 
z-plane. The points 21, 22, -- + Zan are uniformly spaced on the circum- 
ference of the small circle, the even-numbered ones corresponding to 
positive charges and the odd-numbered ones to negative charges. The 
resultant complex potential function is given by 


w = In : +1n $e tin 
— 24 


€ 20 — 2 Zo 20 — Zan 
1 
—In —In ! —+++—In : | [442] 
2 — & 20 — 23 20 — Z2n-1 
or 
wa 2€n (29 = 21) (Zo se 23) ae (9 —_ Zon—1) [443] 


€ (Z — %2)(%0 — 24)--- (20 — Zen) 
Now it is recognized that the values of 1, 23, - + - Zon1 are the roots of 
the equation 
a +a"=0. [444] 
which are ; 
z= aV=1 = ae = (y = 1,3, 5,---2n—-1) [445] 


and the values of 2g, 24, - + Zan are the roots of the equation 


2” —a"=0 [446] 
which are 
B= aV1 = ae”) — (y = 2, 4, 6, - +» Qn) [447] 
Consequently, 
(20 ~ 21) (Zo — 23) +++ @o — Zen-1) = %" + a” [448] 
and . 
(0 ~ Z2) (Zo — 24) +++ (Zo — Zan) = %" — a” [449] 
so that Eq. 443 becomes 
oo tin(ts) 0 
Again if 
a <1 [451] 


|zo| 
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the logarithm in this expression is very nearly given by 


2q ( 2a 4ga”™ 
ee In —_— J nw 
we 1+ a [452] 


If the moment of the 27-pole is defined as 
m = 2ga” | [453] 


its complex potential function becomes 


a ite = ze = eine [454] 
Y 

f eye which yields the result expressed by Eq. 

435 for 2 = 1. 

When the 2-pole is located at an arbi- 

trary point z and has an arbitrary orienta- 

tion, as shown in Fig. 27, its complex 


potential at the point zo is given by 
2mei”¥ 


w=—— 
& (Zp — 2)” 


[455] 


Fic. 27. A 2n-pole centered at 2. because Z in Eq. 452 becomes replaced 

by (zo — z)e*. Here the angle y is evi- 
dently determined only within an additive positive or negative integer 
multiple of the angle 2x/n. 


If again one writes 


(% — 2) = rei [456] 
when ; 
w= an einte-D [457] 
. & 
so that 
“us a cos n(y — $) [458] 
and 
I= am sin n(y — ¢) [459] 


According to Eq. 455, the 2n-pole evidently represents a pole of 
multiplicity for the function w. 
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It is now of interest to consider a configuration of charges resulting 
when many identical dipoles are placed side by side, with a uniform 
infinitesimal spacing, their centers lying on any curve and their moments 
all pointing toward the same side of this curve. Such an arrangement is 
shown in Fig. 28. If the dimension perpendicular to the plane of the paper, 
in which longitudinal uniformity obtains, is mentally supplied to this 
picture, it is readily recognized that this arrangement may be regarded as 


Fic. 28. The cross section of a sheet having a uniform surface distribution of dipoles. 


a uniform surface distribution of dipoles, and that the result is equivalent 
to having a surface or membrane uniformly covered with a charge of 
equal magnitude but opposite sign on its two sides, the thickness of the 
membrane corresponding to the spacing 2a between the charges in the 
dipoles. 

This configuration is called a uniform double stratum. If the magnitude 
of the surface density of charge is denoted by ¢, and 2a is the thick- 
ness of the membrane, or the separation of the surface charges of 
opposite polarity, 


2ao = T [460] 


is defined as the moment of the double stratum. In terms of the individual 
dipoles with their charges +g, one recognizes that if the uniform spacin 
between the dipoles is denoted by the differential ds, : 


q=ods [461] 
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and hence 
2aqg = m = rds [462] 


The problem is to determine the scalar electric potential « at the point 
&. This may be determined through integrating the expression 440 over 
the surface of the double stratum. Thus, the differential contribution to 
the net potential at zo due to the single dipole at the point z is, with the 
help of Eq. 462, 


27 ds cos (Wy — ¢) 


= [463] 


2m 
du = ~~ cos (W ~ $) = 


But from Fig. 28 it is recognized that 
—r dd = ds cos (Wy — ¢) [464] 


so that Eq. 463 becomes 
du = ~ “as [465] 


Hence one obtains for the net potential at the point zo 
2r {* 2 
w= [du = - = [ae = Fo - 2) [466] 
E Vo, € 


This remarkably simple result states that the potential is proportional 
only to the net angle subtended at the point zo by the boundaries of the 
double stratum. The shape of the double stratum between these bounda- 
ries is immaterial. It is essential to the achievement of this simple result 
that the moment 7 of the double stratum be constant over its entire 
surface. The expression given by Eq. 466 applies only to the potential of a 
uniform double stratum. = 

It is now a simple matter to find the potential function which is the 
conjugate of this potential « of the double stratum. It is pointed out 
above in connection with the single dipole that the conjugate to the 
electric potential function u results when that dipole is rotated through 90 
degrees in the clockwise direction. According to the principle of super- 
position, the conjugate to the double-stratum potential results when all 
the dipoles constituting it are rotated through 90 degrees. This process 
converts the double stratum into a dipole chain. If one chooses to make 
the spacing ds between the dipoles in the double stratum equal to the 
spacing 2a between the charges in the individual dipoles, that is, if 


ds = 2a [467] 
the adjacent positive and negative charges in the dipole chain fall on 


top of one another and hence cancel, so that there is left only one nega- 
tive charge —g at the point z, and one positive charge g at the point 4. 
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The choice indicated by Eq. 467 is, according to Eq. 462, equivalent to 
‘making 


Q=T [468] 

so that the potential function conjugate to « given by Eq. 466 becomes 
27 V2 
=—in2 

v= In 5 [469] 


Since replacing u by v and v by —w again yields a pair of conjugate 
potential functions, one may also write 


u= é In 75 [470] 
and 
2 
v= = (b1 — $2) [art] 
These are evidently the real and imaginary parts of the complex function 
2a (G=*) [472] 
& 2— 22 


in which zo of Fig. 28 is now considered to be any variable point z. 

The response functions of linear passive electrical networks (driving 
point and transfer impedances or admittances) are rational functions 
of the form 
(2 — 21) (2 — 23) +++ (2 — 22m_1) 
2) =O 

fs) (2 — 22)(@ — 24) +++ (@ — Zam) 
In problems of this sort it is expedient to write 
f(z) = ef = eA? [474] 


and have, for the so-called Joss and angle functions A and B, 


[473] 


2 py, @— 2) — %) +++ (@ — Sem1) 
ee es (2 ~ a2)(z — 4) +++ (@ — Bam) [475] 


Thus the loss function A may be regarded as the electric potential result- 
ing when a system of identical charged conductors are placed at the 
points 22, 24, +--+ 22m and a set of conductors with charges of opposite 
sign are placed at the points 21, 23, - + + Z2m—1. The function A is usually 
studied for pure imaginary values of z, that is, for points on the imaginary 
axis in the z-plane (these correspond to real frequencies in steady state 
circuit analysis). B is then the conjugate potential function with respect 
to A. A problem in network design is thus reduced to the determination 
of a distribution of the points 21, 22, 23, : - - which yields suitable functions 
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A and B. The visualization of the problem lent by the analogy to po- 
tential theory can in many cases be helpful. 

If the rational function, Eq. 473, is expanded in partial fractions (as 
shown in Art. 16), each term is recognized as having the form of Eq. 439. 
The real and imaginary parts of this function may, therefore, be in- 
terpreted as the potentials due to dipoles at the points Z2, 24,°+ + Z2m- 
More specifically, the real part (which may be a resistance or con- 
ductance) is the potential of the dipole distribution represented by the 
partial fraction expansion of f(s), whereas the imaginary part (a corre- 
sponding reactance or susceptance) is the conjugate potential resulting 
after all the dipoles are rotated through 90 degrees in the clockwise 
direction. 


24. SoME USEFUL FUNCTIONS IN CONFORMAL MAPPING; THE LINEAR 
FRACTIONAL FUNCTION 


The artifice of introducing a change of independent variable by 
substituting for the given one some function of a new variable usually 
implies the transformation of one region of the complex plane into 
another. A contour in the one region is thereby mapped upon the other. 
The two regions may be mutually exclusive, or partly or wholly over- 
lapping, depending upon the nature of the transforming function. A very 
common substitution is 


1 
aaagrt [476] 
which is more explicitly written 


nitty stak wee aeee 


whence 
7" ~y 
g= Ee’ n= EP [478] 
In polar co-ordinates 
z= rei* [479] 
gs = pe? [480] 


so that the transformation 476 yields 


p= [481] 


bs ed 
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and 
6 = —@ [482] 


Because of the relation 481, this substitution is spoken of as a transforma- 
tion in terms of reciprocal radii. Since p is larger than, equal to, or smaller 
than unity as 7 is respectively smaller than, equal to, or larger than unity, 
it is seen that the region of the z-plane within the unit circle is trans- 
formed into the region outside this circle, and vice versa, whereas the unit 
circle itself remains intact except for a shifting of specific points according 
to the relation 482. In particular, the points +1 on the real axis remain ~ 
fixed by the transformation, and these are consequently spoken of as the 
fixed poinis. 

’ A significant property of the transformation 476 is that the origin 
and the point at infinity become interchanged, that is, z = 0 corresponds 
to ¢ = ©, and z= corresponds to ¢ = 0. Thus a given function f(z) 
may be studied in the vicinity of z = « through introducing the change 
of variable 476 and then studying the resulting function F(¢) = f(1/z) in 
the vicinity of ¢ = 0. ' 

A variation of this transformation reads 


= : [483] 
in which the bar indicates the conjugate value. With 
pF = pte” [484] 
this yields 
tat | [495] 
and 
=> [486] 


All the points on the unit circle are now fixed points. Corresponding points 
z and ¢* may be found by means of the geometrical construction indicated 
in Fig. 29, as may be seen from the fact that the similar right-angle 
triangles yield the relation 

jel _ 04 

OA |z*| 
in which OA is the radius of the unit circle and hence equal to unity. The 
point ¢* is said to be the mage of the point z with respect to the unit circle. 
The transformation indicated by Eq. 483 is, therefore, also spoken of as a 
reflection or inversion with respect to the unit circle. As the point z traces 


[487] 
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some figure inside the unit circle, the corresponding point ¢* maps the 
image of this figure outside the unit circle. 

If the unit circle is replaced by a circle of radius R, the transformation 
reads ¢* = R?/z. The geometrical construction shown in Fig. 29 applies 
to any value of R. If R — |z| is held constant as R is allowed to become 
larger and larger, and attention is focused 
upon that part of Fig. 29 in the vicinity of 
the points z and ¢* alone, the circle will 
appear to degenerate into a straight line 
and the two points in question will ulti- 
mately be symmetrically located with re- 
spect to this line in the sense of an object 
and its mirror image. 

Other variations of this transformation 
which are sometimes used are ¢/ = —¢ 
Fic. 29. The point z and its and ¢* ‘= —s*. All these substitutions 
image ¢* = 1/2 as obtained by have the property of transforming circles 

graphical construction. into circles, as isshown from a more general 
point of view in the subsequent discussion. 

The transformations ¢ = +1/z are, of course, conformal, and hence, 
as shown in Art. 2, angular relationships are preserved in sense as well as 
in magnitude. The transformations ¢* = +1/2 are isogonal but not 
conformal. They preserve angular relationships except for a reversal of 
the positive sense of angular measurement, as in the right- and left-hand 
relationship between an object and its mirror image. For any closed con- 
tour not encircling the origin,* the direction of traversal is preserved by 
the transformations ¢ = +1/z, whereas it is reversed in the transforma- 
tions ¢* = +1/2. These relationships are indicated for a set of circles in 
Fig. 30, in which a typical point P on the given locus z is also indicated in 
each of the transformed loci. 

It is collaterally useful to observe that, for the transformations ¢’ = 
—1/z and ¢* = 1/2, a given locus lying wholly in the upper (respectively 
lower) half plane, remains wholly within that half plane. In these cases, 
the upper (respectively lower) half plane is said to be transformed into 
itself. Similarly, the substitution ¢ = 1/z transforms the right (respec- 
tively left) half plane into itself, whereas (*’ = —1/Z transforms the 
right into the left half plane, and vice versa. In a specific problem the 
particular form of the substitution chosen is thus seen to depend upon the 
detailed requirements set by the nature of that problem and the purpose 
for which the substitution or transformation is made. 


*The conclusions in this statement are reversed if the contour does enclose the origin. 
This fact may readily be appreciated from the study of a few simple examples, 
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The substitutions ¢ = +1/z are special cases of the more ‘Seneral 
linear fractional or bilinear form 


[488] 


Fic. 30. Corresponding circular loci of z and various related reciprocals. Note the 
various senses of angular relationships. 


in which the complex constants a, b, c, d are subject to the condition that 
the so-called determinant ad — bc of the transformation shall not vanish, 
that is, 


ad — be #0 [489] 
and, in addition, 
c#0 [490] 
The function 488 then is regular everywhere except at the one point 


d 
ne [491] 


at which it has a simple pole. 

Inclusive of this point, there is a unique correspondence between all 
points in the z- and ¢-planes, so that the inverse of the function 488 which 
reads 

dg—b 


oS eee [492] 


has identical mapping properties. 
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The condition 490 is necessary in order to prevent 488 from degenerat- 
ing into an ordinary linear form.* The reason for the condition 489 is 
appreciated through consideration of two points ¢, and ¢2 in the ¢-plane 
and their corresponding points 2, and 22 in the z-plane. Equation 488 
then yields 


- (ad — bc)(z — %) 
(czy + d)(czq + d) 
whence it is clear that a unique correspondence of points does not result 
if ad — bc = 0. 
In addition to ¢; and 2, two more points {3 and ¢4 together with their 


corresponding points zg and z, are now considered. By analogy to the 
form of Eq. 493, it is then readily seen that 


$1 — $2 [493] 


fs—f1  & —% ca td 
———— = —— X 494 
fs — $2 %—% cH +d [494] 


and 


bo—-  & — 4 ea +d [49s] 


i — 2 fe en Ot ta 
and hence that 


63 — $1 Sof 84 [a4 [496] 
$3 —- Sef $4 —S2 2% — fof By — &e 


With reference to Fig. 31 it is clear that 


zg —%1 — [zs — 1 ia 
eel SO eae 497 
23 — 22 |z3 = 22 . [ ] 


and 


fa — 1 _ | — al i [498] 

f — 22 [Za — 2a). 
Now if the points 2, 22, 23, 2, are assumed to lie on a circle, it follows 
that a = §. The right-hand side of Eq. 496 is then seen to reduce to a 
positive real constant. Since, according to a familiar geometrical proposi- 
tion, the angles « and 8 can be equal only if the four points in question 
lie on a circle, and since, according to Eq. 496, a similar geometrical 
relationship to that shown in Fig. 31 likewise applies to the quantities 


*The mapping properties of the linear transformation w = az-+ 6 are recognized as 
amounting to a magnification and rotation according to the magnitude and angle of the 
complex constant a, and a translation equal to the value of 6. Any picture or map subjected 
to this transformation obviously remains undistorted. The mapping properties of the linear 
transformation are, therefore, said to be homographic. 
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$1, $2) £3, $4, one may conclude that the corresponding points £1, £2, ts, ¢4 
must also lie on a circle. An important property of the transformation 
488 or its inverse 492 is thus proved, namely, that this linear fractional 
form carries circles over into circles. In other words, a circular locus in the 
z-plane corresponds to a circular locus in the ¢-plane. Straight lines are 
included here since they represent circles with infinite radii, but it is not 
implied, of course, that straight lines are 
necessarily carried over into straight lines. 

The linear fractional transformation is, in- 
cidentally, the most general analytic trans- 
formation yielding a one-to-one correspond- 
ence between all points in the simple w- and 
z-planes. The term “ simple ” w- or z-plane is 
intended to denote such a one in which the 
unique distinction between points does not 
require the concept of a multileaved Rie- 
mann surface. With the interpretative aid of 
such a surface, any analytic function yields ia-ahowing that (he sear 
a one-to-one correspondence between points fractional’. . Granaformation 
in the w- and z-planes. In the simple complex carries circles over into 
plane however (“schlichte Ebene” in the ‘circles. 
German literature), a one-to-one correspond- 
ence between points is afforded only by the so-called schlicht functions, 
of which the linear fractional function is the most general form. 

The fixed points of the transformation are those for which ¢ = z, 
or, Eq. 488 being used, they correspond to the z-values satisfying the 
equation 


Fic. 31. Construction used 


az +b 
a +d ee 
which is equivalent to 
cz2+ d—a)z—b=0 [500] 


The roots of this quadratic equation may be denoted by 2, and zp. Then 
| Si =a and —f_ = Be [501] 


If the £-plane is superimposed upon the z-plane, any circle passing 
through the points 2, and z2 is transformed into another circle passing 
through these same points. Hence the family of circles passing through 

‘the fixed points 2, and 22 is transformed into itself in the sense that any 
one of these circles is transformed into another belonging to the same 
family. Now these circles possess an orthogonal family of loci which are 
also circles. These enclose the points 2, and zg, as shown in Fig. 32. Since 
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the transformation is a conformal one, it follows that the transformation 
of an orthogonal circle must also be orthogonal to any of the circles 
through z; and 22. Hence the family of orthogonal circles is transformed 
into itself also. 


z-plane 


Fic. 32. Circles through the fixed points 2, and ze are transformed into other circles 
through the same two fixed points. 


For proper choices of the constants in Eq. 488, this transformation 
may be made to carry any specified circle in the z-plane over into an 
independently specified circle in the ¢-plane. In order to see this one may 
write by analogy to Eq. 496 


c-& oh fa’ [2-4 


6 — $2 3 — {2 %—22/ 23 — 2 


[502] 


in which 21, 2, 23 are distinct and z is a variable point on a given circle 
in the z-plane, whereas £1, f2, ¢3, and ¢ are the corresponding distinct 
and variable points on the resulting circle in the ¢-plane. If one writes 
for the sake of abbreviation 


§3 — $1 7 23 — 21 
be 503 
§3 — Sef %3 — 2 [503] 
Eq. 502, which is equivalent to the transformation 488, takes the form 
ee ae [504] 


o — $e Z— 22 
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Here 2, 22, 2g are any distinct points determining a specific circle in the 
z-plane, and ¢1, £2, ¢3 are any chosen corresponding points in the ¢-plane 
determining a circle which the desired transformation is intended to 
yield. 

In particular, the points 2; and zp in Eq. 503 may be selected as the 
fixed points of the transformation. Then ¢; = 2; and £2 = 2g, so that 


3 — 24 y, 23 — 21 
= ——_ / —— 505 
§3 — 22/ 23 — 22 © [505] 
Equation 504 then reads 
6-4 z—~ 24 , 
oes ey ee 
ie, Pees [506] 


This is called the normal form of the transformation 488 or its inverse 
492. 

Two special types of this transformation may now be defined. For the 
first of them, any member of the family of circles passing through the 
fixed points 2; and 22 is transformed into itself. This statement means 
that the points 23 and ¢3 lie on the same circle, although they are, of 
course, not coincident like the points ¢; = 2; and £2 = 22; otherwise the 
transformation would degenerate into the trivial identity ¢ = z. The 
points 21, 22, 23, £3 are then four distinct points on the same circle, like 
the four points 21, 22, 23, 24 in Fig. 31. According to the reasoning which 
shows the right-hand side of Eq. 496 to be a- positive real constant, it . 
follows from the similarity between the right-hand sides of Eqs. 496 and 
505 that H is a positive real nonvanishing constant. This then is the condi- 
tion for which the transformation 506 (or its equivalent 488) carries any 
circle through the fixed points z; and 22 over into itself. This type of the 
transformation is designated as hyperbolic. 

The second special type has the property that any one of the orthogonal 
family of circles enclosing the fixed points is transformed into itself. In 
order to recognize the condition which yields this result, one must recall 
that any circle enclosing the fixed points is, according to well-known 
principles in analytic geometry, defined as the locus of a point for which 
the ratio of its distances from the two fixed points (or poles) remains 
constant. If z denotes a variable point on such a circle, 

[s — «| = constant [507] 
\z = Zq| 
The condition that the transformed point ¢ shall lie on the same circle 
evidently reads 
2 2 
&Z— &2 


[508] 


= _ 
§ — & 
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which, with regard to Eq. 506, means that 
-|Al = 1 (H # 1) [509] 


The parenthetic statement H ~ 1 is intended again to rule out the 
trivial case ¢ = z. This special type of the transformation is designated 
as elliptic. 

With regard to the further general properties of the linear fractional 
transformation given by Eqs. 488 or 506, it is useful to recognize that 
this transformation not only maps the points on a given circle in the 
z-plane upon a corresponding circle in the ¢-plane, but also furnishes a 
one-to-one correspondence between all points within those regions of the 
z- and ¢-planes bounded by these circles. In other words, the transforma- 
tion is said to map one of these regions upon the other. 

Now a circle may be considered to be the boundary either of the region 
enclosed by it, or of the complementary region which lies outside it. In 
order to remove this ambiguity, and also to furnish a clearer visualization 
of how the mapping of corresponding regions is effected by the trans- 
formation, one may consider a specific circle K in the z-plane and the 
corresponding circle C in the ¢-plane. The points 21, 22, 23 lie on the circle 
K, and the respectively numbered corresponding points £1, £2, {3 lie on 
the circle C. In the order in which they are numbered, the points in each 
set form a sequence which fixes a definite direction of traversal (clockwise 
or counterclockwise) around their respective circle. In this way, corre- 
sponding reference directions of traversal are fixed for the two circles K 
and C. It is clear from the preceding discussion that a variable point 
following along the circle K in its reference direction is transformed into 
a point which follows along the circle C in the corresponding reference 
direction. 

One may now imagine that at some point z the variable point following 
along K suddenly leaves this circular boundary by making a right-angle 
turn to the left. According to the principle of conformality, the corre- 
sponding variable point on C must likewise make a right-angle turn to the 
left at the point ¢ corresponding to z. If one imagines a second circle K’ 
in the z-plane lying wholly within the region to the left of K, with a diam- 
eter which differs from that of K by a small amount, this line of reasoning 
shows that the corresponding circle C’ in the ¢-plane lies wholly within 
the region to the left of C. Continuing this process of reasoning (by 
applying the same line of thought to the circles K’, C’ and a pair K’’, C’”” 
lying wholly within the regions to the left of K’ and C’, etc.) one finds it 
clear that the entire region to the left (or to the right) of K is mapped 
upon the region to the left (or to the right) of the circle C. 

_ If the reference directions of traversal around the circles K and C are 
the same (both clockwise or both counterclockwise), the region inside 
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(respectively outside) K is mapped upon the region inside (respectively 
outside) C, whereas if the reference directions are opposite, the region 
within K is mapped upon the region outside C, and vice versa. 

As an illustrative example, let it be required to find the function 
which will map the region inside the unit circle about the origin upon the 
region outside the unit circle. Since the unit circle forms the common 
boundary for the two regions in question, one must look for that linear 
fractional transformation which transforms the unit circle into itself. 
If the reference direction of traversal for the unit circle in the z-plane is 
assumed to be counterclockwise, whereas for the unit circle in the ¢-plane 
it is assumed to be clockwise, the regions to the left of these boundaries 
_ in the z- and {-planes are those which are to be mapped upon each other. 
With these points in mind, the normal form for the required transforma- 
tion is readily established. 

‘In Eqs. 505 and 506 one may choose the following correspondence of 
points: 
2 = +1 Z% = —1 %3 = ~j 


[510] 
Si&=4t1 f&=-1 f= ty 


This makes z = +1 correspond to the fixed points, Substituting into 
Eq. 505, one finds 


_j-ijis-j-i 

jt+i/ —-j+1 

so that the normal form of the required transformation, according to 
Eq. 506, becomes 


= —-1 [511] 


(Slee Sac | 
eo a [512] 


This is equivalent to 
1 
t= 2 _ [$13] 


which is the simple transformation discussed earlier in this article, and 
evidently yields the desired mapping relationship. By choosing different 
sets of corresponding points which nevertheless satisfy the condition of 
opposite reference directions around the unit circle, one may find in- 
numerable additional transformations which also map the inside of the 
unit circle upon the outside. It is, of course, not necessary that two of the 
chosen points be the fixed points of the transformation, since Eqs. 503 
and 504 apply to any arbitrary sets of corresponding points. 

As a second example, let it be required to find the substitution which 
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maps the inside of the unit circle in the z-plane upon the upper half of the 
¢-plane. According to the principles discussed above, the following 
corresponding sets of points may be chosen: 


a = +1 %=—l z= —7 
%&=tl &=-1 §=0 [514] 


The unit circle in the z-plane is traversed in the counterclockwise direc- 
tion from +1 around to —1 and through —7 back to +1; the real axis in 
the ¢-plane is traversed from +1 through the point » to —1 and thence 
through the origin back to +1. The regions to the left of these boundaries 
are those which are to be mapped upon each other. Substituting into 
Eq. 505, one finds 


—1 /-j-1 1-9 ‘ 
= - = —— = — 515 
fi <a fae! [S15] 
so that the normal form of the desired transformation becomes, according 
to Eq. 506, 


t-t1 2-1 


oti “ett _ 16] 
This is equivalent to 
eth ceed 
— L oor = jit 517 
oe egy See eg [517] 


Here the points z = -+1 again are the fixed points. 
Alternatively one may solve the present problem by assuming the 
corresponding sets of points 


4 = +1 f= ty zg —1 
Si =-1 tf =0 os = +1 [518] 


None of these are fixed points. Consequently the forms 503 and 504 must 
be used. According to Eq. 503, 


2/7 -2 
w= {/apopat ti B19] 


Substituting this into Eq. 504, one finds 


aos Pe Saat 2 

= >; 20OOr = 520 

ee: or rar [520] 

Comparison with Eq. 517 shows that these results are identical except 
that zis replaced by —z. 

From the nature of this problem it is obvious that Eqs. 517 or 520 still 
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constitute solutions if z is replaced by ze**, where ¢ is any chosen angle. 
For example, replacing z in Eq. 517 by —jz yields another possible 
solution, namely, 


1-2 ‘Seat 3 
= © ss” 521 
go care j+t [521] 
This has the corresponding points 
z=+i, +7 —1 —j 
I) ’ J [522] 
¢ = 0, + 1 3 oe 3 cae 1 
Again none of these are fixed points. The point z = —1 here corresponds 


to the point in the ¢-plane. In the transformation 517 the latter 
corresponds to the point z = 7; in Eq. 520 the point z = —7 corresponds 
to ¢ = «. Evidently the present problem may be solved with any 
desired point on the unit circle in the z-plane corresponding to the point 
o in the ¢-plane. 

Alternatively, one may for example map the right half of the ¢-plane 
upon the inside of the unit circle in the z-plane. The transformation 
which yields this result is obtained from any of the solutions to the pre- 
ceding example by replacing ¢ by 7. This change amounts to making the 
substitution ¢’ = —j¢ and subsequently writing ¢ for ¢’ again (a substi- 
tution which rotates all points by 90 degrees in the negative direction). 
Applied to the Eqs. 521, for example, this process yields 


1-2 Re ak % 
oes or $=T4e¢ [523] 
for which a set of corresponding points are 
z= +1, os —1, —j 
[524] 


g= 0, 7 wo, ty 


It is observed that these are the same points as those in the set 522 except 
that the {-points are rotated 90 degrees in the negative direction, so that 
the imaginary instead of the real axis becomes the boundary of the 
mapped region in the ¢-plane. 

* It is instructive to study the conformal maps of one of these functions 
in somewhat greater detail. The transformation 523 is particularly in- 
teresting in this respect, since it is perfectly symmetrical in the variables 
z and ¢. Figure 33 shows how a system of concentric circles about the 
origin in the z-plane, along with the orthogonal family of radial lines, 
are mapped in the ¢-plane. The origin in the z-plane becomes the point 
+1 in the ¢-plane. The concentric circles within the unit circle of the 
z-plane become eccentric circles about +1 in the ¢-plane, the unit circle 
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itself corresponding to the imaginary axis of the ¢-plane. The radial lines 
within the unit circle of the z-plane become those portions of the orthogo- 
nal circles through the points +1 which lie in the right half of the ¢-plane. 
The circles of the z-plane (shown dotted) which are larger than the unit 
circle become eccentric circles in the ¢-plane about the point —1, and 
the portions of the orthogonal circles through the points ¢ = +1 which 
lie in the left half of that plane (also shown dotted) correspond to those 
portions of the radial lines in the z-plane lying outside the unit circle. 


~w 


! 
~ 
* 


——- 


- 


(a) (b) 


Fic. 33. Map of the interior of the unit circle in the 2-plane to the right-half ¢-plane 
by the transformation z = (1 — ¢)/(1 +9). 


A more comprehensive view of this situation is gained through recog- 
nition that the points +1 and —1 of the ¢ plane are transformed into the 
origin (south pole) and the point « (north pole), respectively, of the 
complex sphere associated with the z-plane. In other words, the poles of 
the complex sphere associated with the z-plane are transformed into the 
finite points -+1 of the ¢-plane, and the resulting map in that plane 
represents the corresponding distortion suffered by the concentric circles . 
and radial lines in the z-plane which accompanies this shifting of the 
poles. This interpretation is responsible for the term “ bipolar circles ” or 
“ bipolar plot ” by which the map in the {-plane is also known. 

Since the forms 523 are symmetrical in the variables z and §, it follows 
that the sets of orthogonal loci in the 2- and ¢-planes of Fig. 33 may be 
interchanged. That is, if concentric circles about the origin together with 
the family of radial lines are drawn in the {-plane, the corresponding map 
in the zplane becomes that which in Fig. 33 is shown for the ¢-plane. 
In other words, the transformation 523 may alternatively be said to map 
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the inside of the unit circle in the ¢-plane upon the right half of the 
z-plane. 

A word of caution may be appropriate at this point in order to guard 
the reader against confusing part (b) of Fig. 33 with the similar appearing 
plot of Fig. 32. In the latter, the poles 2; and 22 are the fixed points of the 
transformation. Figure 33(b), on the other hand, is not the corresponding 
plot for the transformation 523, since the fixed points in this case are 
evidently not the points ¢ = +1. The maps in Fig. 33 are merely a pair 
of corresponding sets of curves chosen to illustrate the way in which the 
region enclosed by the unit circle is mapped upon the right half plane, 
and these curves have nothing in common with the type of plot shown in 
Fig. 32. The reader may further clarify his thoughts on this score by 
determining the fixed points of the transformation 523 and by subse- 
quently drawing loci of the type shown in Fig. 32. 

In connection with the preceding general discussion it is relevant to 
add the following remarks regarding several collaterally useful properties 
of the linear fractional transformation. It should readily be appreciated, 
for example, that the end result of carrying out two different linear 
fractional transformations in succession can also be had from a single 
one. More specifically, if the relation 


re as + by 
era? [525] 


represents a transformation from the variable z to a new variable 2’, 
and 
, 
me Az + be 


= a ee [526] 


represents a succeeding transformation to still another variable 2’’, 
it is always possible to relate z’’ to z directly by a transformation of the 
same form, namely, 


pp BAD 
ag [527] 
The truth of this statement follows immediately from the consideration 
that the linear fractional form carries circles over into circles and con- 
versely that any univalued analytic transformation of circles into circles 
is linear fractional (see footnote on page 375). Thus, two such trans- 
formations applied in tandem obviously accomplish no more than can be 
accomplished by a single one.* 


*Because of this property the totality of possible linear fractional transformations is said 
to form a group. 
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The coefficients a, }, c, d in the single resulting transformation 527 may 
be determined from the coefficients in the two separate transformations 
525 and 526, through substituting the relation 525 for z’ into Eq. 526 and 
putting the result into the form of Eq. 527. One finds that 


@ = Aga, + beocy 
b = doh; + body 
C = Cody + doc, 
d = Cob, + ded, 


It is interesting (as well as relevant in dealing with circuit problems 
involving a cascade of transmission networks) to observe that the rela- 
tions 528 may be expressed more compactly by means of the single 


matrix equation 
a'b|_|ae be ay | 
E i . [< a - bi dy [529] 


Another interesting fact is brought out when the linear fractionat 
transformation, Eq. 488, is put into the form 


[528] 


[530] 
a+ 2 


as is always possible so long as the conditions 489 and 490 are fulfilled. 
In view of this form, one may regard the arbitrary linear fractional 
transformation as equivalent to carrying out in succession the simpler 
transformations indicated by 


g=et ¢ [531] 
¢ 
gf! = sf [532] 
2 
6 ad 
c= ¢ - “) g’ + 7 [533] 


The first of these component transformations represents a displacement, 
the second is a simple inversion, and the third represents a magnification 
and rotation (multiplication by a complex constant) followed by a dis- 
placement. Since each of these component transformations carries circles 
over into circles, one arrives again at the conclusion that the linear 
fractional transformation has this property. 
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Finally, an interesting correlation with the process of stereographic 
projection may be mentioned. As pointed out in Art. 4, this geometrical 
process is one whereby every finite point in the complex z-plane has’ 
uniquely associated with it a point on the so-called complex sphere which 
is tangent at the origin. If a second plane ({-plane) is imagined to 
be tangent to the same sphere at some other point, and if the same 
geometrical process is used to associate points on the sphere with points 
in the second plane, it is evidently possible to state that the system con- 
sisting of the two planes and the sphere, together with the geometrical 
process of stereographic projection, enables one to associate uniquely a 
point z in one of these planes with a corresponding point ¢ in the other 
plane. 

It is of interest to show how the transformation implied by this process 
can be expressed analytically in terms of the linear fractional function 
488.* Although the converse of this statement is not generally true, it is 
worth noting that some of the linear fractional transformations en- 
countered in practical problems may be interpreted geometrically in this 
simple fashion. 

In order to develop the appropriate analytic relationships, one may 
begin by visualizing a sphere of diameter D and any two tangent planes 
(the z- and ¢-planes). The two points of tangency determine uniquely a 
great circle on the sphere. The plane containing this great circle is con- 
veniently chosen as a cross-sectional plane in which to indicate graphically 
further pertinent geometrical relationships. Figure 34 shows the system 
as viewed in this plane, to which the z- and ¢-planes are both orthogonal 
and hence appear as lines. The two polar axes, which are diameters of the 
sphere and are normal to the z- and f{-planes at their respective points 
of tangency, make an angle with each other which is denoted by +. 
The real and imaginary axes in the z- and ¢-planes have orientations which 
at the moment may be considered to be arbitrary. 

The points in the ¢-plane corresponding to z= 0 and z= o are 
denoted by {po and ¢., respectively, whereas those in the z-plane cor- 
responding to ¢ = 0 and ¢ = o are denoted by 2 and gz... These four 
points lie in the plane of the polar axes, and their locations in the z and 
¢-planes are shown in Fig. 34. From the geometry it is readily seen that 


|Z0| = tol a Dtan5 : [534] 


*The possibility of doing so should be evident from the geometrical proposition that the 
process of stereographic projection carries circles in the plane over into circles on the sphere, — 
and vice versa. Hence a circle in one of the planes is transformed into a circle in the other 
plane. Any univalued analytic transformation which carries circles over into circles. is expres- 
sible in terms of a linear fractional function. : 
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lg.| = [fol = D cot 5 [535] 
in which the absolute value signs are necessary because the lines repre- 
senting the z- and ¢-planes in Fig. 34 are not necessarily coincident with 
the real axes in these planes. 


Fic. 34. In many cases, a linear fractional transformation may be visualized as 
a twofold stereographic projection in terms of two planes tangent to the same complex 
sphere. 


With reference to the linear fractional function 488 and its inverse as 
given by Eq. 492, it is expedient in the subsequent considerations to 
observe that only three of the four constants a, b, c, d are independent, 
since the right-hand members of these relations have the same value if 
their numerators and denominators are multiplied by a common (finite, 
nonzero) factor. Inasmuch as it is assumed that c # 0, one may choose the 
constant 1/c as this common factor, or, what amounts to the same thing, 
arbitrarily set c equal to unity. Equations 488 and 492 then yield the 
results 


and 


So = a [537] : 
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Comparison with Eqs. 534 and 535 shows that 
la| = |d] = D cot 5 [538] 


and 
lo] = D? [539] 


Referring to Fig. 34, one recognizes that the angles of the complex 
quantities z and z. (in the complex z-plane) must differ by w radians 
and that the same is true for the quantities [9 and ¢.. In view of Eqs. 
536 and 537, this fact yields the conclusion that 


o must be a negative real number [540] 
This result, together with the condition that \a| = |d|, represent the 
restrictions which must be imposed upon a given linear fractional function 
in order that it may possess the simple geometrical interpretation dis- 
cussed here. For any given linear fractional function fulfilling these con- 
ditions, the diameter of the sphere and the angle y between the polar 
axes are given by Eqs. 538 and 539, whereas the orientations of the real 
and imaginary axes in the z and {-planes are obtained from Egs. 536 and 
537. Thus the corresponding geometrical configuration is determined. 

Conversely, if the geometrical configuration for the sphere and the two 
tangent planes is given, and one wishes to find the corresponding linear 
fractional function, three of the relations contained in Eqs. 536 and 537 
may be used to determine the complex constants @, b, and d. The magni- 
tudes of these constants must, of course, agree with the values given by 
Eqs. 538 and 539. This determination is always possible. 

As an illustration it is interesting to consider the transformation 523 
by means of which the interior of the unit circle in either plane (z or ¢) is 
mapped upon the right half of the other plane. By inspection of Eqs. 523 
one sees that 


a=-—ti 6=1 ad=1 [541] 
and also that 
fo = 1 tf. =-1 


These values are consistent with Eqs. 536 and 537 as, of course, they 
should be. From Eqs. 538 and 539 one obtains 


[542] 


Del 41=5 [543] 


378 FUNCTIONS OF A COMPLEX VARIABLE [Ch. VI 


The transformation in question is, therefore, given geometrically 
(according to the present interpretation) by means of a sphere of unit 
diameter and a pair of tangent planes which are normal to each other as 
well as normal to the plane containing the polar axes. This configuration 
is shown in Fig. 35. Although the values 542 uniquely determine the 
relative orientations and positive directions for. the real axes in the z- and 
g-planes, the positive directions for the imaginary axes are not uniquely 
fixed, because all these values are real. If the imaginary axis for the 
z-plane is assumed to point vertically 
upward from the plane of the paper in 
Fig. 35, and the unit circle in the 
z-plane is traversed in the counter- 
clockwise direction so as to keep the 
interior on the left, then, since the im- 
aginary axis in the ¢-plane must be 
traversed in such a direction as to 
keep the right half plane on the left, 
Fic. 35. Visualization of the trans- 07° observes that the nagar. aus 
formation zg = (1—)/(1+¢) asa for the {-plane points vertically into 

twofold stereographic projection. the plane of the paper in Fig. 35. 

This choice for the positive directions 

of the imaginary axes places the top faces of the z- and ¢-planes op- 

posite the sphere. It is alternatively possible to assume both positive 

directions for the imaginary axes reversed, whence the top faces of the 
planes appear adjacent to the sphere. 

To anyone who has developed a facility for visualizing geometrical 
objects in three dimensions, this interpretation for the transformation 523 
affords a useful means for correlating its various detailed characteristics. 


25. A MORE GENERAL MAPPING FUNCTION; THE SCHWARZ- 
CHRISTOFFEL FORMULA 


The principal field of usefulness with regard to conformal mapping is 
found in connection with problems in potential theory. When the geome- 
try of the physical system exhibits longitudinal uniformity in one of its 
dimensions, the problem reduces to a two-dimensional one. As shown in 
Art. 5, the real and imaginary parts ~ and 9 of a function of a complex 
variable satisfy Laplace’s equation in two dimensions and hence represent 
potential functions. Since the loci for « = constant and » = constant 
form orthogonal families, they may be regarded as the equipotential and 
flow lines of a nonturbulent field, that is, of a pure potential field. If a 
pair of such functions can be found whose loci conform to the geometrical 
boundaries of a given physical system, the solution to the boundary 
value problem for that system is thereby given. 
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For example, Poisson’s integral, Eq. 323 or Eq. 324, yields the potential 
inside a circle in terms of specified boundary values. This integral, there- 
fore, constitutes a solution to the boundary value problem for the circle, 
the specific forms given by Eqs. 332 and 333 expressing this solution as a 
sum of exponential functions. 

By means of a function which conformally maps the interior of a circle 
upon a region having a different geometrical configuration, the boundary 
value problem for that configuration may be transformed into an equiva- 
lent one for the circle, and the desired solution is found through applying 
to the solution for the circle, the inverse transformation. 

The boundary is commonly an equipotential locus, so that the specifica- 
tion of boundary values amounts merely to stipulating that the potential 
function shall be constant on the boundary. In electrical problems, this 
boundary is the surface of a conductor or of a material having a large 
permeability relative to that of the surrounding medium. The solution 
to the boundary value problem for the circle then consists simply of 
concentric circles for the equipotential loci, with the orthogonal radial 
lines representing the flow field. The origin, or center of the circle, and 
the point at infinity represent the source and sink for this field. 

An even simpler geometrical configuration is given by a pair of bounda- 
ries in the form of parallel lines. When these are constant potential loci, 
the flow map consists of a rectangular grid of flow lines and equipotential 
lines. Such a rectangular region is mapped upon the interior of the circle 
by means of the function 


z=e or t=Inz [544] 
Writing | 
z=rei> and t=m+jn [545] 
one finds from Eq. 544 that 
m=Inr and n=¢ [546] 


If the origin (r = 0) and infinity (r = ) in the z-plane represent the 
source and sink for the field, the radial flow enclosed by a pair of lines 
= ¢1 and ¢ = ¢2 (these may be ¢ = 0 and ¢ = 27) is mapped upon 
the -plane as a parallel flow enclosed by the straight lines » = ¢, and 
n =‘. The equipotential loci, which in the z-plane are the circles 
r = constant, become the straight lines m = constant, which are at right 
angles to the flow lines n = constant. Source and sink become the points 
m = +. These matters are illustrated in Fig. 36. The rectangular flow 
map in the -plane may be regarded as a reference field (a flow map of the 
simplest type) to which the radial type of flow map is transformed by the 
function 544. 
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Inasmuch as the upper half of the z-plane may be conformally mapped 
upon the upper half of the ¢-plane, according to the discussion of the 
previous article, with any two desired points* on the real axis of one 
half plane corresponding to the origin and infinity in the other, it follows 
that a flow map in the upper half plane, with any two points on the real 
axis designated as the source and sink, may be reduced to the reference 
field in the ¢-plane of Fig. 36. If transformations can be found which map 
regions having other geometrical configurations for their boundaries, 
upon the upper half plane, or upon the region enclosed by the unit circle, 
a way is established for also reducing the flow maps for these configura- 
tions to a simple rectangular reference field, thus making possible the 
solution to boundary value problems in these more complicated cases. 


Fic. 36. Transformation of a radial flow map to a simpler flow pattern by the 
transformation z = e¢’, 


An extremely useful mapping function, of considerable generality in 
its ability to meet various geometrical configurations, is given by the 
so-called Schwarz-Christoffel formula, which reads 


w(z)= uf (§ — a) — 22) (Ee) de +N [547] 


Here ¢ is a running variable in the z-plane, 21, go, + » » %, are ” finite points 
on the real axis, numbered in such an order that 


By < 22 < +++ < kn [548] 


and the quantities 4;, u2, -- + un appearing in the exponents are any set of 
positive or negative real numbers. The constants M and N may have 
complex values, with the possibility that N be zero, but M must, of 
course, have a nonzero value. The lower limit 2 of the integral is an 
arbitrary point in the upper half plane. It may be chosen equal to zero, 
or equal to one of the points 2; - - - z,. The principle guiding this choice is 
best seen from the illustrative examples given subsequently. 


*A third point (for example, midway between the other two) may be chosen to correspond 
to the point +1 (which lies between the origin and infinity). 


Art. 25] THE SCHWARZ-CHRISTOFFEL FORMULA 3ST 


The independent variable for the mapping function w(z) is the upper 
limit of the integral. For this reason the derivative of the function is 
given by 


d: 
= = M(z — %)-(@ — 2) ++ (@ — Bay [549] 
as may be seen from the fact that if one has 
w(z) = f f(s) d& [550] 
2 


the usual definition for the derivative 


dw. ., [w(z + Az) — w(z) 
es [ Az ] [S51] 
yields 
1 24 As 
w(e+ Az) —w(e) = ffs) ar [552] 


Since Az is a small displacement (becoming zero in the limit), one may 
say that for the integration in Eq. 552 the function f(¢) is essentially 
constant and equal to the value f(z). It is assumed, of course, that the 
function f(¢) is continuous in the vicinity of the point ¢ = 2, which is a 
recognized condition for the existence of the derivative in the first place. 
With f(¢) equal to the constant value f(z), it may be placed in front of the - 
integral sign, and Eq. 552 yields 


: e+ As 
w(e + ds) = w(e) fe) [de = fe) az [583] 


the approximation becoming exact in the limit Az — 0. Completing the 
limit, one finds, therefore, that 


dw 
7 71@) [554] 


The essential character of the function w(z) may now be recognized 
from a study of the behavior of the derivative 549 in the vicinity of the 
points z = z,. The first step in this direction is to represent the factor 
(g ~ z,) in the polar form as illustrated in Fig. 37. This representation 
reads 

(2 — %) = |s — ae tst?7®) [555] 


in which & is an integer. 
Then 
(3 — By) = |z — a [ re Tererterkny [556] 
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Since the quantity yu, is not necessarily an integer, the right-hand side of 
Eq. 556 may have many different values for different integer values of k. 
In order to remove this multivaluedness of 
the factor (z — z,)~™, it is specified at the 
outset that & shall assume only the value 
zero. This specification is equivalent to 
stating that the function dw/dz is to be 
studied on only one of the many leaves of 
its Riemann surface, namely, on that one 
which corresponds to k = 0 in Eq. 556. A 
typical factor in Eq. 549 then becomes 


Fic. 37. Representation of 
%— 2, in polar form in the 
study of dw/dz. (2 — 2) = | z- %, |~Bre Furby [557] 


and if the point z is allowed to lie only in the upper half plane or on the 
real axis of the z-plane, it is clear from Fig. 37 that 


05% S57 [558] 
When the polar forms 
M = |M|e* [559] 
and 
dw |dw 4 
dz a] ° [560] 


are introduced, it follows that ; 
@ = a — pid — Bode — + ** — Undbn [561] 


It is now assumed that the variable z in the function 549 is restricted to 
real values only; that is, the variable point z is thought of as moving along 


z-plane 


Fic. 38. The path along which dw/dz is studied in the Schwarz-Christoffel trans- 
formation. ~~ 


the real axis from — «© to », the only deviation from this behavior occur- 
ring wherever the variable point z encounters one of the critical points 
2». There it makes a slight detour around the critical point instead of pass- 
ing directly through it. These detours may be visualized as having the 
form of vanishingly small semicircular arcs lying in the upper half plane, 
as shown in Fig. 38. As the point z traverses a small semicircular arc in 
the vicinity of the point z,, the angle ¢, changes from the value 7 to zero, 
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whereas the angles of the remaining factors do not change at all because 
of the assumed vanishingly small radius of the semicircular detour. 
Hence for the range 

Z1 <2 < Say [562] 


¢o1=$3= ee =,1 =0 rZdy 20 drt = Srp2= soe =h,=7 [563] 
and* according to Eq. 561 
a— (My tora + aie +un)T <6 Sa— (u4i1+emret’ : tun) [564] 


Throughout the range 562, the angle @ is, therefore, increased by the 
amount 
AO = pyr [565] 


the important feature being that this increment occurs only as the point z 
traverses the small semicircular arc. In other words, as the point z moves 
along the real axis, the angle @ remains constant as 2 proceeds from one 
of the critical points to the next, receiving a sudden increment A@ = wz 
only as z passes directly over the critical point z,. 

According to the discussion of conformal mapping in Art. 2, it is recog- 
nized that the map of the function w(z) in the w-plane, correspond- 
ing to the real axis in the z-plane, 
consists of a succession of straight- 
line segments between the points 1, 
We, +++ corresponding respectively to 
21, Z2,"°*, the angular directions of two 
consecutive segments confluent in the 
point w, differing by y,7. That is, the 
map in the w-plane of the function 
547, corresponding to the real axis in 
the z-plane, traversed from — © to ~, 
has the general character shown in 
Fig. 39. This result follows from the 46 39. The map in the w-plane 
fact, pointed out in Art. 2, that the of the real axis in the z-plane shown 
angle of dw/dz equals the difference be- in Fig. 38. 
tween the angles of the increments dw 
and dz, and since the angle of the latter remains zero as the point z 
moves along the real axis, the angle of dw/dz must equal that of dw. 
This angle, however, is shown to remain constant except when 2 passes 
over one of the critical quantities 2, At the corresponding points w, 
then, the direction of the increment dw suddenly changes by the 
amount pT. 


*If u, is negative, the inequalities in Eq. 564 are reversed. 
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The plot in the w-plane corresponding to the real axis in the z-plane 
is thus seen to be a polygon with the points w - ++ w, as its vertexes. If 


wy toe tess + in = 2 [566] 


the sum of the increments AO at the vertexes w, --- w, equals 27. 
Since 2m equals the sum of the external angles of a closed polygon, it 
follows that when the condition 566 is fulfilled, the polygon is m-sided. 
Unless one or more of the values of 41 - - + un are equal to or greater than 
unity, all the » vertexes of the polygon lie at finite points corresponding 
to the finite values z, --- 2, on the real axis of the z-plane. The point at 
infinity in the z-plane corresponds to an ordinary point on the straight- 
line segment joining the vertexes w, and w. If a factor yw, has a positive 
value equal to or greater than unity, the corresponding vertex of the poly- 
gon in the w-plane lies at infinity. This circumstance is subsequently dis- 
cussed in detail. 

When the condition 566 is not fulfilled, the point at infinity in the 
z-plane also corresponds to a vertex, and the polygon in the wplane has 
‘# + 1 sides. In order to consider this possibility more fully it is neces- 
sary first to study the function w(z) in the vicinity of the point z = «. 
For this purpose it is convenient to make the substitution 


—1 -1 
* — j = 
= a with 3*, = [567] 
Then 
(fF — 2) = (Fat). (CF [568] 
and 
ag* 
df =—yG 
g (AP [569] 
so that Eq. 547 takes the form 
_ vad (2) hee (¥en (ge) ua toe ts t+un—2) 
wit) = Me Po TN [870] 
in which 
—1 
eee 
Oe [571] 


The derivative reads 
dw M(z*,)" +> (a*, An (gh ur tuat ee +nn—2) 


aC I aRea EEE CE 2) 


The point z = © corresponds to z* = 0. For the vicinity of 2* = 0, 
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Eq. 572 gives 
a ~ (- 1)"M (gt) teats t+un—2) [573] 


Hence it is clear that the point z = © yields a vertex at which the angular 
increment Aé@ has the value 


AO= (2 — m1 — we — °° — Bn) [574] 
This vertex also lies at a finite point in the w-plane unless the quantity 
(2 — uw — ua — +++ — um) is equal to or larger than unity. It is obvious 


that the vertex does not exist if the condition 566 is fulfilled. 

In order to study the behavior of the function w in the vicinity of one 
of its vertexes, one may first write for the derivative 549 

dw 

We: = (- Zr) “#Tdg + a1(z — Sr) + do(Z — %)? +--+] [575] 
in which the bracket expression is a Taylor expansion for the function 
549 with the factor (z — z,)~#” missing. For this Taylor expansion, which 
is evidently possible because the function it represents is regular at the 
point z = 2,, the coefficient dp is certainly not zero. Hence assuming 
pw» < 1 the term by term integration of Eq. 575 yields 


. _ ME ~ He) — 2, a(g — 2) et? 
= dw fo as 
eas Lf = Seg ee 


For the immediate vicinity of the vertex at the point w = w,, therefore, 
one has the representation 


+-++++w, [576] 


ao(Z — 2, —prtl 
ww, + goles SSE [577] 
1— My 
This analysis shows that so long as », < 1, the function w(z) is regular 
in the vicinity of the vertex w,. This vertex may have a variety of ap- 


Y Yy 
w ~~ 
~ UY yt My 
O< pwy<1 ~1<uy<0 My=-1 ~Qeuyp<~l 
(a) (b) © @) 
Fic. 40. Appearance of vertexes of polygon in the w-plane with varying values 
of . 


pearances depending upon the particular value of u, < 1. Some of these 
are shown in Fig. 40. The arrows on the line segments indicate the direc- 


386 FUNCTIONS OF A ‘COMPLEX VARIABLE [Ch. VI 


tion in which the contour of the polygon is traversed as the point z 
travels in the positive direction (from — to + oo) along the real axis 
of the z-plane. The region enclosed by the polygon, which is that on the 
left of the contour, is shown shaded. 

Part (a) of the figure shows the appearance of the vertex when 
0 <n, <1. Here the external angle has a positive value between zero 
and x. Part (b), for which the external angle has a negative value be- 
tween zero and —z, corresponds to —1 <u, < 0. Part (c) shows the 
vertex for uy, = —1. Here the external angle equals —7. Finally, part (d) 
illustrates the appearance of a vertex at which the external angle is less 
than —-7 (that is, equal to the negative of a quantity which is larger 
than 7). In this case the enclosed region involves a double-mapped 


portion. 
Tf uw, = 1, Eq. 575 yields 
d 
= do(z — 2) + ay + ao(s — 2%) + °° [578] 


and the integration then gives 
w= [aw = dgln (8 — &) + 41(% — %) +F@— a) + +C [579]: 


in which 
C = —ao In (2% — &) — 1(% — %) — °° [580] 


In this case the vertex w, evidently lies at infinity, since the value of the 
right-hand side in Eq. 579 becomes infinite for z = z,. Figure 41 shows 
how this vertex may be imagined to appear, if 
My™. it is assumed that one is permitted to indicate 
cao ol infinity as a finite point. The enclosed region in 
wy, this vicinity is null, since it is contained be- 
My=1 tween line segments which fall upon each 
Fic. 41, Appearance of Other. In the finite region of the w-plane the 
the vertex when», = 1. appearance of a polygon with a vertex w, of 
this sort at infinity is shown in Fig. 42. The 
vertex of Fig. 41 at infinity is seen to result from a pair of confluent line 
segments which are parallel and hence meet or intersect at infinity. 
When yw, has any value larger than unity, the corresponding vertex 
also lies at infinity. Figure 43 shows the appearance of a polygon with 
such a vertex. The external angle lies between + and 27, and yw, has a 
value between 1 and 2. For yw, = 2 the external angle equals 27. Analysis 
similar to the preceding then shows that the function w(z) has a simple 
pole at the point z = z, in addition to having a logarithmic infinity there. 
In the light of the preceding discussion and interpretation it may now 
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be stated by way of a summary that the function defined by the integral 
547 uniquely and continuously maps all points on the real axis of the z- 
plane upon the boundary of a polygon in the w-plane. Since its derivative 
as expressed by Eqs. 549 and 572 is regular at all points in the upper half 
of the z-plane inclusive of the point at infinity, it follows that this func- 
tion is regular and continuous throughout this entire half plane and is 
single-valued by virtue of the stipulation regarding the choice of values 
for the factors (¢ — z,)~"". It may be concluded, therefore, that the 
function also maps the entire upper half of the z-plane upon the region en- 


ye 


w-plane 


w-plane 


Fic. 42. Appearance of the polygon Fic. 43. Appearance of the polygon in 
in the finite w-plane when one of its the finite w-plane when one of its 
vertexes corresponds to un, = 1. vertexes corresponds to pn, > 1. 


closed by the polygon in the w-plane, for any closed boundary lying wholly 
inside the polygon must by reason of the continuity and single valuedness 
of the function also lie wholly within the upper half of the z-plane. The 
availability of this mapping function and that represented by the linear 
fractional form discussed in the preceding article greatly enhances the 
means for solving two-dimensional boundary value problems, as is illus- 
trated by the following examples. 

The first problem to be discussed is the determination of the field dis- 
tribution in the vicinity of the edges of a parallel-plate condenser. Since 
the distortion of the field (“fringing”) is confined to the more im- 
mediate vicinity of the edge, the plates may be assumed to be infinitely 
wide, and since the field distribution is symmetrical about a center line 
between the plates, it is sufficient to map the field on one side of this center. 
line only. The region over which a field map is desired may, therefore, 
be sketched as shown in Fig. 44. The edge of the condenser plate is at the 
point C above the origin of the w-plane. The plate itself is parallel to the 
real axis of this plane and extends infinitely far to the left. The real axis 
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represents the center line between the plates, so that the distance d 

equals half the spacing between them. 
The shaded area, throughout which the field map extends, is regarded 
as the region enclosed by a polygon which has three vertexes. One of 
these lies at the finite point C, and 


Za _ Wy the other two lie at infinity. One of 
. these latter two results from the 

4, — region A with its parallel boundaries 
_ extending to infinity on'the left. This 


one, which may be designated as 

“ vertex A,” evidently has the char- 

Fic. 44. Relevant to determining the acter of the vertex at infinity for the 

field of a parallel-plate condenser. polygon of Fig. 42. The other vertex 

at infinity, which may be designated 

as “ vertex B,”’ has the character of the vertex w, in Fig. 43 for yu, = 2. 

It represents the infinite extension of the region B to the right and upper 
left of Fig. 44. 

The electric flux being considered as the fluid, the flow in the parallel- 
plate condenser is from one of its plates to the other, whereas the equipo- 
tential loci are. the orthogonal curves symmetrically grouped about the 
center line. However, from a mathematical standpoint it is equally 
admissible to consider the flow in the space between the plates to be in 
the general direction along the center line, with the equipotential loci 
extending from one plate to the other. In other words, the flow map 
consisting of orthogonal families of loci depends only upon the geometrical 
configuration of the boundaries, and hence it makes no difference which 
of the families of curves is thought of as representing the flow of a physical 
fluid. In the present instance it is convenient to think of the fluid as 
flowing sideways, with the plates as longitudinal boundaries and the 
vicinity of the edges as the throat from which this fluid issues into the 
entire surrounding space. According to this point of view, the vertex A 
becomes the source and the vertex B becomes the sink. 

If the region enclosed by the polygon of Fig. 44 is now mapped upon 
the upper half of a z-plane in such a way that the origin and the point 
at infinity for this plane are identified respectively with the vertexes 
A and B of the polygon, the equivalent flow map in the z-plane is simply 
given by the concentric circles about the origin (equipotential loci) and 
the radial lines extending from the origin to infinity (flow lines). The 
vertex at the point C in Fig. 44 is then represented by a finite point on the 
real axis of the z-plane. 

In view of the fact that the points on the real axis of the z-plane cor- 
responding to the vertexes of the polygon must be arranged in such an 
order that they are encountered in the same sequence during a traversal 
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of the real axis in the positive direction (leaving the upper half plane to 
the left) as are the vertexes of the polygon when its boundary is traversed 
in the corresponding positive direction (leaving the enclosed region to 
the left), it follows that the point on the real axis of the z-plane cor- 
responding to the vertex C must lie to the left of the origin as indicated 
in Fig. 45. This is the point designated as 


z= 2,. Since this vertex is like the one pe 
shown in part (c) of Fig. 40, the corres- 

ponding exponent y; has the value —1. y Wp Wy 
The vertex A, which corresponds to z = 0, Y/ IG Y 
is like the one shown in Fig. 41. The c A 


p-value for this vertex, therefore, is +1. fie. 45. Contourin thez-plane 
As pointed out in the preceding discus- for the problem of Fig. 44. 
sion, a vertex which corresponds to the 

point z= © is not represented by a factor of the form (¢ — &) 
in the integral 547 but comes about by virtue of the behavior of this 
function in the vicinity of the point at infinity. In other words, the cor- 
responding factor is implicitly rather than explicitly contained in the 
general form of the integral, which for the present problem is now recog- 
nized to read 


z dt 
w=M | ——_a TX 581 
ln (§ — 2) 1(¢ — 0)? [581] 
Thus for the vertex B (corresponding to z = ~) one has, according to 
Eq. 574, 

Ag = (2+1-—1)" = 20 [582] 


which checks with the above statement that this vertex has the character 
of w, in Fig. 43 for uw, = 2. 
Equation 581 may more appropriately be written in the form 


wou [Ewe ry [583] 


Tt is now convenient to choose the lower limit zo equal to 21, corresponding 
to the vertex C in Fig. 44. Here w must have the value jd, and since the 
integral in Eq. 583 vanishes for z = 2 = 21, it follows that 


N = jd [584] 


The constant M may be evaluated through calculating the increment — 
in the function w corresponding to the increment in z resulting from a 
traversal of the semicircular detour about the point z = 0. For this 
semicircular path one may write 

5 = pel? [585] 
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in which p is the radius of the small arc. For small values of ¢, the factor 
(¢ — 2:) in the integral of Eq. 583 may be replaced by —21, and inasmuch 
as Eq. 585 yields 


of = j ds [586] 


it is seen that the increment in the function w corresponding to the arc 
increment in s becomes 


Aw = —2,Mj f dp = jrzM [587] 


This increment in w must equal the change in the value of w which cor- 
responds to passing around the vertex A. With reference to Fig. 44 one 
sees that this change equals the increment in w between the bottom face 
of the condenser plate and the center line, and hence that this increment 
in w must equal —jd. Thus 


Aw = —jd = jrz,M [588] 
whence 
ee [589] 
7 WS, 


- The representation for the function w now reads 
_@ (#2) a 


WS, 21 £ 


+ jd [590] 
The integration, which is straightforward, yields 


w= {1-2 4ins-tna| +d [591] 
T 41 

Since there are no further conditions to be satisfied, it appears that the 
location of the point z; on the negative real axis in Fig. 45 may be chosen 
arbitrarily. A choice which yields a simple form for the resulting mapping 
function is z, = —1. Since In (~1) = jx, Eq. 591 becomes 


w= te+ing [592] 


This is the desired mapping function which converts the family of 
concentric circles about the origin of the z-plane and the orthogonal 
family of straight lines radiating from the origin (both confined to the 
upper half plane) into the orthogonal families of loci which map the field 
and the equipotential lines for the condenser plate in Fig. 44. If desired, 
this map may be converted to the rectangular reference form shown in the 
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i-plane of Fig. 46 through making the further change of variable given 
by Eq. 544. Then Eq. 592 becomes 
d 
w= me +t+e'] [593] 
The reference field in the plane, with several points of particular 
interest marked on it, is shown in Fig. 46. The edge C of the condenser 
plate in Fig. 44 where w = jd corresponds to t = jx. The under side of 
the condenser plate in Fig. 44 corresponds to the portion of the horizontal 
line ” = jx to the left of C; the 
top side of the plate corresponds 
to the portion of this line to the 
right of C. The center line in Fig. 


t-plane 


under side of plate top side of plate 


44 is given by the real axis of the 
t-plane. The transformation 593 is 
seen virtually to remove the 360- 
degree bend in the boundary at 
the edge C in Fig. 44 so that the 


ft try 
[w=O[ [| center line [TTT] 
ann 
t=~1.279 0 


Fic. 46. The field of the parallel-plate 


: condenser in the /-plane. 
top side of the condenser plate 


appears as a linear extension of the bottom side. It may also be of in- 
terest to note that the origin of Fig. 44 does not appear directly below 
the point C in the +plane but is shifted to the negative real point 


— os vg 


Fic. 48. The contour in the 
z-plane for the problem of 
Fig. 47. 


Fic. 47. Relevant to plotting the 
flow in a right-angle bend. 


i = —1.279. The process of transferring the rectangular grid of lines in 
the é-plane of Fig. 46 to the region of interest in Fig. 44 by means of the 
function 593 is left as an exercise for the reader. . 

As a second example let it be required to determine the field map for 
the flow around the right-angle bend indicated by the shaded region in the 
w-plane of Fig. 47. The polygon in this example has four vertexes. One 
lies at the origin O, another at the finite point P, and the remaining two 
at infinity. Both these have the character of the vertex w, of Fig. 42. 
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They are here denoted as the vertexes A and B. They become the source 
and sink respectively for the present problem. 

The points on the real axis of the zplane which are chosen to cor- 
respond to these vertexes are indicated in Fig. 48. The source vertex A 
is assumed to lie at the origin and the sink vertex B at infinity. The 
proper order, as determined from the corresponding directions of traversal, 
requires that the vertexes O and P lie respectively to the left and to the 
right of the origin. These are designated as the points z = 2, and 2 = 2e. 

Recognizing that the exponents y, for the vertexes O, A, P are respec- 
tively +4, +1, and —4, and choosing the lower limit 29 of the integral 
547 equal to z;, one has 


ie : dt 

w= M ei '¢¢ —_ a)'/4(¢ _ o)'(¢ an Bg) 1/2 + N [594] 
or : 
_ 1/2 


ln £(¢ — 2)? 


Since the origin of the w-plane corresponds to the point 2 = 21, one has 
w(z,) = 0, and hence Eq. 595 yields 


N=0 [596] 


The reason for choosing the lower limit of the integral as the point z = 2; 
‘is thus evident. 

In order to study the function in the vicinity of the vertex B which 
occurs for z = ©, it is necessary to make the change of variable 


—1 
=> [597] 
whence 
dt d¢* 
= [598] 


The vicinity ¢ = © is identified with the vicinity ¢* = 0. The increment 
in the function w resulting from passing around the vertex B at ¢* = 0 
is determined through considering the integral 595 for ¢ —> «© and making 
the substitutions 597 and 598. ‘This gives 

p d¢* 
=5 * 
in which the integration is to be extended over a small semicircular arc 
of radius p about the origin of the ¢*-plane. For this arc 


s* = pei? [600] 


Aw = —M [599] 
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and 
ae* 
ot SP 
Iu [601] 
so that the integral 599 becomes 
aw = —jM f° do = jxM [602] 


But “ passing around ” the vertex B in the w-plane amounts to going 
from the right- to the left-hand boundary of the cross-hatched region B 
in Fig. 47 and hence yields an increment in the function w equal to ~d,. 
Therefore , 


Aw = —d, = jrM [603] 


In a similar manner, passing around the vertex A corresponding to 
the vicinity of ¢ = 0 yields 


Z [Pd ag . Zo 7 Zo 
dw = M ea M ae f dé = —jxrM J 
- aa —e ¢ is Zn : ve 21 [604] 
But according to Fig. 47 this must equal ida, Hence 
jdz = —jxM .J— [605] 


2 


Equations 603 and 605 determine the constant M and the ratio 22/2). 
Since there are no further conditions to be fulfilled, one of the points 
Z, OF Z2 may be chosen arbitrarily. For the moment this choice is left 
open. The ratio is found to be given by 


S77 Or = a5 [606] 


This result agrees with the fact that 22 is positive and z, negative. 
The required function is now represented by 


at pad deat 


dg [607] 


a OVE — ay 
which may be converted into 
Ih $= 2 
Treg eee 608 
eer eer (¢ — 21) (¢ — 22) [608] 


or 


jn Jdi22 dt 
= ff ————_ ~ —————— [609 
x I V (5 — a1) (5 — 22) v J wv — a) (¢ — 22) [609] 
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If in the second integral one makes the change of variable indicated by 
Eq. 597 and also the consistent changes 


1 
a= ry a* 19 bara [610] 


with the help of Eq. 606, one finds 


ja, f°” dg dy (* ay* | | 
® i Vi(g—aj(e—%) 7 i Vg = 21) (C= oy) a 


The integration yields 
jay fe — 4a, +22) +V (2 — a) (2 — “2i} 
w=-—In 1 
T 3 (1 — 22) 
x _ lak * (eI CoE 
2 AC atigass 2 ican Cs 31) (2 aa [612] 
3 (2"1 z 2) 


Inasmuch as an arbitrary specification with regard to 2, or Ze still 
remains open, it is possible, in agreement with Eq. 606, to set 


+2 1n| 
vie 


me kee 
mT and 4 = a [613] 
Then Eq. 612 takes the form 
id d 
w= dy +72 fe) +f") [614] 
in which 
2_ 72 7 AN. TN 
5+ aE (n+ 2s -%) 
an 2dd2 dy dy [61s 
fe _ d,? + dy? ] 
2dde 
For purposes of calculation it may be helpful to note that the sub- 
stitution 
d. 
go: AF = *) 
fie SZ 616 
aaa Cre ve 
2 dy ds 
converts the expression 615 to the form 
f2) =n’ +V@’P— 1) [617] 


If d, = dg, the resulting mapping function is greatly simplified inasmuch 
as Eq. 616 shows that then 2’ = z. 
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As in the previous example, the field map may be reduced to a rec- 
tangular grid by means of the additional substitution z = e’. 


26. HURWITZ POLYNOMIALS; STABILITY CRITERIA 


In problems dealing with the dynamics of a physical system one is 
frequently concerned with the question of the stability of its behavior. 
In terms of the so-called characteristic equation of the system, which 
has the form 


On” + dp13" 1 +--+ + a2 + a = 0 [618] 


with real coefficients ag - - - dn, this question is answered in the affirmative 
if it can be established that all the real roots and all the real parts of the 
complex roots are negative. Stated in another way, the stability of the 
physical system is assured if it can be shown that all the zeros of the 
polynomial ; 

P(s) = Gy2" + Gn12" | + +++ + a2 + a [619] 


lie in the left half of the z-plane. A polynomial having this property is 
called a Hurwitz polynomial. 

The necessary and sufficient conditions which the coefficients of an 
arbitrary polynomial (with real coefficients) must satisfy in order that 
it be a Hurwitz polynomial are spoken of as the Hurwitiz criteria. In 
dynamics these same conditions are alternatively referred to as Routh’s 
stability criteria. 

Starting with the factored form of the polynomial 619, which reads 


P(@) = an(@ — %1)(@ — &2) ++ (@ — Zn) [620] 


one may readily establish (by multiplying out and collecting terms with 
like powers of z) that 


Cook = (Gy + te t+ + Bn) 

an 

= = 24% ~ S183 es Tt Sn_130 [621] 
% 
= (—1)" + 2122 °- ‘Ln 


Tf it is assumed that all the roots are real and negative, and that a, > 0, 
it is evident that all the coefficients are positive. If some or all of the roots 
are in the form of conjugate complex pairs, then one may likewise estab- 
lish (by the use of Eqs. 621) that all the coefficients are positive if the 
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roots have negative real parts. It is thus seen that the positiveness of the 
coefficients* is a necessary condition for the negativeness of the real parts 
of the roots of a given algebraic equation, but this condition alone proves 
to be insufficient, as is shown by the following discussion. 

One begins by investigating the significant properties of Hurwitz 
polynomials. In these considerations two simple facts form the basis 
from which the pertinent conclusions are readily obtained. These facts 
are, first, that the zeros of the polynomial P(z) lie in the left half of the 
z-plane and, second, that their distribution in this half plane is sym- 
metrical about the real axis. The last statement is evident from the fact 
that complex roots must occur in conjugate pairs. 


Fic. 49. Relevant to the properties of a third degree Hurwitz polynomial. 


The polynomial is considered to be in its factored form as given by 
Eq. 620. For an arbitrary value of the complex variable z, the several 
factors for a polynomial of the third degree may be represented graphically 
as shown in Fig. 49. Parts (a) and (b) of this figure show how the graphi- 
cal representation of the factors changes when the algebraic sign of the 
variable z is reversed. It is clear that if z is chosen to lie in the left half 
plane (as in part (a) of Fig. 49) —z lies in the right half plane, and vice 
versa. It may also be seen that if z is replaced by its conjugate value 2, 
the factors collectively represent the same set of magnitudes since pairs 
of factors representing conjugate roots merely become interchanged. 
This is, of course, also obvious from the fact that P(Z) is the conjugate 
of P(z), and hence these two values of the polynomial have the same 
magnitude. On the other hand, if the point z is replaced by its image 
about the imaginary axis (this amounts to replacing z by its negative 
conjugate value), collectively the magnitudes of the factors change in the 
same way as they do when z is replaced by —z. Points which are images 


*It is obvious that one may alternatively state that all the coefficients must be negative, 
the significant point being that they all have the same algebraic sign, 
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with respect to the imaginary axis are regarded as corresponding points 
in the left and right half planes. 

It should be clear from the representations in Fig. 49 that for any point 
z in the right half plane, the magnitude of the polynomial is larger than 
it is for the corresponding point in the left half plane. Together with the 
considerations of the preceding paragraph, this fact enables one to see 
without difficulty the truth of the following statements: 


|P(z)| > |P(—z)| for Re(z) > 0 
|P(z)| = |P(—z)| for Rez) =O. [622] 
|P(z)| < |P(—z)| for Re(z) <0 
in which Re denotes “ real part of.” Letting 
P(z) 
P(—3) 
one may alternatively express these results by 
lo(z)| > 1 for Re(z) > 0 
lo(z)| = 1 for Re(z) = 0 [624] 
lo(z)| <1 for Re(z) <0 
It should be clearly recognized that these statements hold only. if 
P(g) is a Hurwitz polynomial, for if P(z) has any zeros in the right half 
plane, points can certainly be found for which these statements collec- 
tively are no longer true. One may, therefore, conclude that if the condi- 
tions 622 or 624 are true, P(z) must be a Hurwitz polynomial. The rela- 
tions 622 or 624 are the necessary and sufficient conditions that a given 
polynomial P(z) have zeros in the left half plane only. By means of the 
succeeding manipulations, these conditions are put into a more usable 


form. 
The first step in this direction is to introduce the function 


o(2) +1 _ P@) + P(-3) 
o(2)—1 P(z) — P(—z) 


According to the discussion given in. Art. 24, this transformation maps 
the interior of the unit circle in the ¢-plane upon the left half of the 
y-plane. Hence one has 


Re(y) > 0 for |d(z)| > 1 
Re(y) = 0 for |[¢(z)| = 1 [626] 
Re(v) <0 for [¢(z)| <1 


$(z) = [623] 


¥(z) = [625] 
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and, with the use of the relations 624, one obtains 
Re(y) > 0 for Re(z) > 0 
Re(v) =0 for Re(z) = 0 [627] 
Re(v) <0 for Re(z) <0 
If the polynomial P(z) is written in the form 
P(z) = m(z) + n(z) [628] 


in which m(z) represents the terms involving even powers of z (called 
the even part of P) and n(z) represents the terms involving odd powers 
of z (called the odd part of P), according to Eq. 625, one has 


m(s) 
ve) =" 
It is thus established that if m(z) and n(g) are the even and odd parts 
respectively of a Hurwitz polynomial, the rational function 629 has the 
properties expressed by the conditions 627, and, conversely, if the ratio 
of the even and odd parts of a given polynomial yields a rational function 
having the properties 627, that polynomial must be a Hurwitz polynomial. 

These properties are now examined in greater detail. Suppose the 
rational function ¥(z) has a pole of the order s at some point 2 = 2). 
The Laurent series for ¥(z) in this vicinity then reads 


b_ 


[629] 


1 
¥(2) = oa ny Ba paar + bo tbie~e%)++++ [630] 
For points very close to z, one may write 
bis 
¥(z) = @oay [631] 
Letting 
b_, = he? [632] 
and : 
(2 — 2) = pe [633] 
one has 
¥(s) & 5 ef (B-sa) [634] 
whence 
Re(y) & ; cos (sa — 8) [635] 


It is thus seen that, in the immediate vicinity of the pole, the real part 
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of ¥(z) assumes large negative as well as large positive values. More 
specifically, as a is allowed to vary from 0 to 27 (the vicinity of the pole 
is explored through passing around it on a concentric circle of small 
radius), the real part of y(z) is observed to change sign 2s times. 

With reference to the conditions 627, one is forced to conclude im- 
mediately that the function ¥(z) cannot have poles in either the right or 
the left half plane. This fact restricts the poles of ¥(z) to lie along the 
imaginary axis of the z-plane. But the conditions 627 together with the 
Eq. 635 impose further restrictions upon these poles. The only conditions 
under which Eq. 635 for a pole on the imaginary axis does not conflict 
with the restrictions 627 are that 6 = 0 and s = 1, for then Eq. 635 reads 


Rey) & : COS & [636] 
According to Eq. 633, Re(z) > 0 corresponds to —2/2 <a < 1/2, and 
Re(z) <0 corresponds to 1/2 <a < 3/2, whereas for Re(z) = 0, 
a = +7/2. Equation 636 is thus seen to yield a real part of ¥(z), which 
behaves in agreement with the conditions 627. The restriction that 
s = 1 means that the pole must be simple, and 6 = 0 requires that the 
residue of ¥(z) in this pole be real and positive. 

The conditions 627, therefore, require that the rational function y(z) 
have poles on the imaginary axis only and that these poles be simple and 
have positive real residues. In order to see that these requirements on the 
function ¥(z) are also sufficient to assure the fulfillment of the conditions 
627, one need merely regard a typical term in the partial fraction expan- 
sion of ¥(z). Such a term reads 

ky 
(2 — 2) 


Since the residue k, is real and positive, and z, is a pure imaginary quan- 
tity, it is evident that the term 637 has the properties demanded by the 
conditions 627, and hence the finite sum of such terms which represents 
y¥(z) has these properties. 

One has thus gained a new formulation for the necessary and sufficient 
conditions that P(z) be a Hurwitz polynomial. Namely, the quotient of . 
its even and odd parts must be a function having simple poles on the 
imaginary axis only, and with positive real residues in these poles. 

It is collaterally useful to digress for a moment and study somewhat 
more carefully the properties of the function #(z). First it should be 
observed that if ¥(z) satisfies the conditions 627, its reciprocal 1/y(z) 
does so also. Hence, using Eq. 629, the function 


1 nz) 


vz) -m(z) 


[637] 


[638] 
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must also have simple poles on the imaginary axis only and positive 
real residues. Both the even and odd parts m(z) and m(z) must, therefore, 
be polynomials whose zeros are simple and lie on the imaginary axis. 
Furthermore, since poles of ¥(z) or 1/y(z) which may lie at z = 0 or at 
z= © must also be simple, it follows that the highest powers of m(z) 
and n(z) as well as their lowest powers may not differ by more than 
unity. They obviously must differ at least by unity because m(z) and 
n(z) are respectively even and odd. 
If one writes 


¥(2) = u(x,y) + joy) [639] 
the conditions 627 are seen to yield 
= >0 forx = 0 [640] 


The Cauchy-Riemann Eq. 12 then shows that 


ov 

wo forx = 0° [641] 
But by Eqs. 627, « = 0 for x = 0, so that this result may alternatively 
be written 


i 
j ay 


which states that along the imaginary axis (where y has pure imaginary 
values only) y¥ is a continuously increasing function. The interesting 
consequence of this fact is that the zeros and poles of ¥(z), which lie 
along the imaginary axis, must alternate. 

If the polynomials m(z) and (z) are written in their factored forms, 
the expression 629 for ¥(z) reads 


>0 forx =0 [642] 


Gn (a? — 217) (2? = 237) ++ + (2? = 272n~1) 
¥(z) = Gn18(2" — Bo") (22 — 247) -- + (2? — 27 on 2) [643] 


in which it is arbitrarily assumed that the polynomial P(z) is of the 
even degree 2”. The alternation of zeros and poles along the i imaginary. 
‘axis is expressed by the conditions 


0< |z1| < |z0| Sees [Zon—2[ < [zon—a| <0 [644] 


This result is referred to as the separation BropetY of the zeros and 


poles of ¥(z). 
Using Eq. 220 for the evaluation of the residues of ¥(z) in any one of 
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its poles at 2 = 29, 24, - + + Zan_2, one has 


ky = E(z = 2y)0 (2) Je=s 


_ Gn (Zy” — 217) (By? — 237) +++ (By? — Bont) [645] 
. On—123y" (2? —Z5”) so Se (z,? —27,_») (27-2749) soe (2,7 —272n 2) 


On the assumption that a,/dn_; is positive, it may be seen from this 
result that the separation property expressed by the inequalities 644 
assures the positiveness of the residues of ¥(z) in all its poles at finite 
frequencies. The residue of ¥(z) at z = ©, incidentally, is recognized to 
be a,/a@n_1, whereas the one at z = 0 has the value of a(z) forz = 0, 
which is positive since all the quantities —z,? are positive. 

In view of these further detailed results it may be stated that if P(z) 
is a polynomial with positive coefficients, and if its even and odd parts 
m(z) and n(z) differ in their highest and in their lowest powers by no 
more than unity and have simple zeros restricted to the imaginary axis 
where they mutually separate each other, P(z) is a Hurwitz polynomial. 

The purpose in thus stating in a variety of forms, the necessary and 
sufficient conditions that P(z) be a Hurwitz polynomial is to focus atten- 
tion upon properties of these polynomials which frequently become 
collaterally useful. The actual process of making such a test may be 
based directly upon any one of the sets of conditions already stated. 
A particularly effective procedure, however, is derived from these con- 
siderations in the following manner. 

For any given polynomial, the function ¥(z) is readily formed accord- 
ing to Eq. 629. If P(z) is of an even degree, ¥(z) has a pole at infinity; 
otherwise 1//(z) has a pole at infinity. Whichever may be the case, one 
begins the procedure by considering that function (y or 1/) which does 
have a pole at infinity. Without any loss to the ensuing argument, this 
function is assumed to be ¥(z). More completely represented, it has 
the form 
= OnZ" ~F On—22" 7 + +++ + O22” + ao 
¥(2 =, On—12" +. Qn—32" 2 + Deg + QZ 


The first step in the procedure is to divide the denominator polynomial 
into the numerator polynomial by the common process of long division, 
however, ceasing after only a single term in the quotient is determined. 
This yields 


[646] 


1 on—2 , ont I 42 ' 
Qn @ n-2o2" * + One” +--+ + Go" +6 
¥(@) = Qn—1 a On 12") + Ange"? + +++ + ays [647] 
which may be indicated more compactly as 
, : 
(ae) [648] 


@n—1 n(z) 
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If P(z), from which y(z) is derived, is a Hurwitz polynomial, ¥(z) must 
have simple poles on the imaginary axis with positive residues. In partic- 
ular the pole at infinity, represented by the first term in Eq. 648, must 
yield a positive residue. Hence one has the first particular condition 


an 


>0 [649] 
Qn-1 

If ¥(z) is imagined to be represented by its partial fraction expansion, 
one may identify the first term in Eq. 648 with that term in this expan- 
sion which represents the pole at infinity. The remaining terms in this 
partial fraction expansion are then seen to represent the corresponding 
expansion for the function given by the second term in Eq. 648. This 
function is the remainder after the pole at infinity is removed from (gz). 
It thus becomes clear that this remainder function must have the same 
properties as ¥(z), namely, simple poles restricted to the imaginary axis 
and positive real residues, and its reciprocal must likewise have these 
properties. 

Denoting the reciprocal of the remainder function by ¥*(g), one has, 


according to Eqs. 647 and 648, 
V(2) = On 18") + On_g2"? + +++ + 48 
a yf? + an ae +--+ +022 + 0’9 


[650] 


This function evidently again has a simple pole at infinity. Repeating the 
process of long division as before, one obtains 


, n—3 , n—5 , 
wig) = Sit gi Co ee ee 
v (z) es @' 4-28" + ET a + niaiecste a’ 92" as ao [651] 
or more compactly 
, 
*(y) — nol n' (2) 
y*(z) Was 2+ m'(2) [652] 


The positiveness of the residue of y*(z) in its pole at infinity requires 
the second particular condition 


i [653] 


The second term in Eq. 652 is a subsequent remainder function which 
again must have the same properties as ¥(z), and its reciprocal must 
also have these properties. The inverted remainder function 


yr@ = 2 


n’ (2) 


again has a simple pole at infinity with a residue which must be positive. 


[654] 
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The continuation of the process is thus clear, and leads successively to 
additional particular conditions like the ones expressed by the inequali- 
ties 649 and 653. The procedure terminates after all the terms in m(z) 
and n(z) are exhausted. 


Letting 
an 
QS 
an—1 
an— 
igo [655] 
a n-2 
, 
@ n—2 
a> 
@ n-3 


ee 


one obtains finally a representation for ¥(z) of the form 


v(s) at : 
oda 
OT ag 


1 

+ one [656] 
which is referred to as a finite S#zeléjes continued fraction.* It contains 
altogether ” terms, as is clear from the following series of fractions indi- 
cating merely the degrees of the numerator and denominator polynomials 
appearing in the original function y(z) and in the successively en- 
countered inverted remainder functions y*(z), ¥**(z), etc. 

n n—1 n—2 2 1 


hea n—2 aa _ 1 0 [657] 


The necessary and sufficient conditions that P(z) be a Hurwitz poly- 
nomial are now simply expressed by the statement that all the quantities 
01, &, ag, etc., as given by Eqs. 655 must be positive. 

This procedure for testing a given polynomial may be replaced by a 
complementary one in which terms representing poles at z = 0 are 
removed from y¥(z) and the successive inverted remainder functions. 
One may say that this variation in the method amounts merely to re- 
placing z by 1/z and proceeding as discussed above. The test is thus 
applied to the polynomial P(1/z) instead of to P(z). Since the transforma- 


*This is one having the form of Eq. 656 in which all the coefficients are positivereal numbers. 
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tion s > 1/z maps the left half plane upon itself, it is clear that P(z) is 
proved to be a Hurwitz polynomial if it can be shown that P (1/z) is such a 
one. 

To carry out this variation in the procedure one begins again with the 
expression 646 for ¥(z) but with the polynomials turned end-for-end. 
The first step in the process of dividing the denominator into the numer- 
ator yields 


_ fo Oe a et 
¥(z) = a2 + ayetees + dy18" [658] 
The first particular condition reads 
<> 0 [659] 
ay 


The inverted remainder must again have the same properties as ¥(z) 
because the second term in Eq. 658 is equal to the partial fraction ex- 
pansion of ¥(z) minus the term for the pole at z = 0. This second term, 
therefore, has the stated properties and so does its reciprocal. The 
inverted remainder, moreover, again has a simple pole at z = 0, and the 
requirement that the residue in this pole be positive yields the second 
particular condition 


=r >0 
as > [660] 


and so forth. 
One thus arrives at the alternate finite Stieltjes continued fraction 


1 
v(z) = fiz* += 


Boe? + 
i 
Sat [661] 
in which 
fi, = eet 
ay 
Bo = o 8 
a als [662] 
q’’ 
Ba = 


eerevrere 
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The necessary and sufficient conditions that P(z) be a Hurwitz poly- 
nomial are that all the coefficients 6, - - - 8, be positive. These conditions 
are entirely equivalent to the ones involving the coefficients a1 --- an 
given by Eqs. 655, although they are different in their detailed appearance. 
It should be clear, however, that the quantities 6, - - - 8, as functions of 
the coefficients do, a1, +--+ ad, of the given polynomial P(z) must be the 
same as the expressions for a; -- - a, in terms of these coefficients except 
that the consecutive order of the subscripts 0, 1, 2, --- on the a’s is 
inverted. In other words, if in the expressions for a1 - + - a, one makes the 
substitutions indicated by 


we ener eee [663] 


the corresponding ones for 8; ++ - 8, are obtained. 

In order to determine the expressions for the a’s or 6’s in terms of the 
coefficients ag --+@, of the polynomial P(z) from the results expressed 
by Eqs. 655 ai 662, it is necessary to carry through the indicated 
processes of long division involved in the steps leading to the continued 
fraction expansions 656 and 661, so as to obtain the coefficients 
a9 +++ Gn 2,4 '9+++ ay 4, etc., as functions of do - - + dy. If this is done 
for the 6’s, one finds that the results obtained for the first three steps 
(that is, for the determination of 81, 82, 83) yield conditions which may be 
expressed in the form 


Di =a >0 
Dz = ot, Ao >O0 

a3 ae 

Be ag. 0 [664] 
Ds =/@3 de ad > 0 

ap a4 a 


with the condition ap > 0 tacitly understood. From the structure of these 
determinants and the recurrent nature of the process of derivation, one 
may assume that the determinant of mth order has the form 


a a O tee 0 
az a2 4, A O Car = 0 


Dn =/!d5 d4 43 Gg Ay UW 0-:--0 [665] 
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The correctness of this generalization may be verified through showing 
that it is consistent with the process of deriving an wih degree Hurwitz 
polynomial from one of the degree » — 1. Suppose the latter is written 
in the form 

Pra = 048") + aga”? + +++ + dn_1Z + dn [666] 
in which the order of the coefficients is reversed, so that the expressions 
for 8 in the former polynomial become identical with those for a in the 
present one. The corresponding function ¥(z) reads 


jj aga" + sss + dn 22” + dn 
ee cae ee Oe ce Ceara ony 
Since this function has a zero at infinity, the one associated with a 
Hurwitz polynomial of the mth degree may be formed by adding to the 
expression 667 a term representing a simple pole at infinity with a posi- 
tive residue. This function may, therefore, be written 
Yn = Vn—1 + Mz [668] 

with the condition that a) > 0. Using Eq. 667, one finds 
r= 

A012" + (Apd3+a2)2" 2+ (apa5 +04) 2" 44> + + (ddn—1 tans) 2° +4n 
42") + aga" 3+ + dnt 


[667] 


[669] 
and the associated polynomial reads 


Py, = A012" + a2"! + (aoag + ag)2”-? + agg”3 
+ (aods + a4)2"* + +++ + Gn12+ a, [670] 


From the manner of its derivation, P, is a Hurwitz polynomial if 
P,1 is known to be such a one, and if ap is positive. The criteria which 
assure that P,_; be a Hurwitz polynomial may, according to Eqs. 665 
and 666, be expressed by writing 


ag a 0 0: 
a4 G3 Gd, a: 
age a5 a4 a3: 
Dr_-1 = | ag Gz ag a5: |>0 [671] 


with the understanding that a, > 0 and that one is to consider all 
principal minors of this determinant which are formed by the first, the 
first two, the first three, etc., rows and columns. 
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According to determinant theory it is readily recognized that 


ay 0 0 0 0 

a3 a2 ad 0 0 

ds Q4 a3 Gg a: 

Q7 Qe dg 4 a3: [672] 


Dr = j 
pend Qg ag a7 ag as. 


which is formed from the determinant 671 by the addition of a first. row 
and column as indicated. Since a, > 0, the conditions formed from this 
modified determinant are identical with those formed from 671 except 
that one must begin with the consideration of two rows and columns in 
order to obtain successively the same conditions as before. 

The first, third, fifth, etc., columns in the determinant 672 are now 
multiplied by the positive constant a9, and the determinant is then 
modified in form (although not in value) by adding to the elements of the 
second column the corresponding ones of the first column, and to the 
elements of the fourth column the corresponding ones of the third column, 
and so forth. This yields 


aoa : Aga 0 0 

Qptg (Apa3 + 42) aod QoQ 

ods (doG5 +44) 903 (pag + a2) ¢ 

Qpdz (dod7 + dg) Ad5 (dod5 + 4) ; [673] 


If the constant a is now imagined to be factored out of the first, third, 
etc., columns, one observes that the result is a determinant D,, from which 
the criteria for the polynomial 670 are formed in the same manner that 
the criteria for the polynomial 666 are formed from the determinant 671. 
Thus, by the method of induction, the correctness of the relations 664 
and 665 for the desired criteria (these are the Hurwitz criteria) is es- 
tablished. 

Routh, who first derived these criteria* (although he stated them in a 
somewhat modified form) made use of a theorem of Cauchy’s, which, for 
the sake of its collateral interest, is briefly discussed in the following 
paragraphs. 

A finite polynomial is written in the form 


P(e) = u(x,y) + jo(a,y) [674] 
‘*Adams Prize Essay, 1877; Rigid Dynamics, paragraph 290. 
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_ and the behavior of the quotient 


aay) = [675] 
is observed as the point z = x + jy traverses once around a simple closed 
contour C which does not pass through a zero of P(z). As is shown in 
detail presently, the function g(x,y) is seen to change its algebraic sign 
either by passing continuously through zero or by passing through the 
value «. Considering only the continuous passages through zero, one 
observes the number of changes of sign from plus to minus and the 
number of changes from minus to plus. Denoting these numbers by r and 
s respectively, the theorem of Cauchy states that 14(r — s) is the number 
of zeros of P(z) enclosed by C. 

To prove this theorem it is expedient to write the polynomial in the 
factored form 


P@)=AG=—n)@—%)--@—s) [676] 


in which A is a real constant and is assumed to be positive. Each factor 
may be represented in the polar form 


(2 — ay) = 7,7 [677] 
whence 
P(s8) = A vty tases ty ftir er tim [678] 
Letting : 
A-n-fa+++ tn = R [679] 
and 
61 + 02 +++ +O, = 6 [680] 
one has 
P(z) = Re? = Reos6+jRsin0 [681] 
Reference to Eqs. 674 and 675 then shows that 
g(x,y) = cot 6 [682] 


It remains to see how the angle @ behaves as the variable z = x + jy 
traverses once around a simple closed contour. For a single factor, as 
given by Eq. 677, this behavior is illustrated in Fig: 50. In part (a) of this 
figure the point z = z, is assumed to lie outside the closed contour C, 
whereas in part (0) of the figure the point z = z, lies within the contour. 
It is immediately apparent that the met change in 0, as 2 traverses the con- 
-tour is either zero or 27 according to whether z, lies outside or inside the 
contour. The net change in the resultant angle 6, as given by Eq. 680, is 
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thus seen to be 27, with NV equal to the number of zeros of P(z) enclosed 
by the contour C. Reference to the expression 682 for the quotient ¢ = 
u/v now reveals the truth of the theorem without further difficulty. 

It may be of interest to point out that the behavior of the angle @ can 
be determined alternatively from the theorem given in Art. 19 involving 


(a) (b) 
Fic. 50. The net change of 0, is 0 for (a) but is 2m for (5). 


the number of zeros and poles of a function within a given region. Let the 
function be f(z), and write it in the polar form 


f@) = au [683] 
Then 
6 = Im (Inf) [684] 
in which Jm denotes “ imaginary part of.” Now 
da = Im (?) [685] 
; f 
and hence Eq. 282 yields 
=i -indt Oa = 
do = a9 =Im$ Fay = RW ~ P) [686] 


Thus the net change in angle Aé@ is found to be equal to 27 times the dif- 
ference between the number of zeros and the number of poles of f(z) 
enclosed by the contour C. The polynomial P(z), given by Eqs. 674 or 
676, can have no poles in the finite z-plane, and hence the present result 
agrees with the conclusion reached above. 


27. POSITIVE REAL FUNCTIONS 


A rational function w = f(z) which is real for real values of z, and whose 
real part is positive for all values of 2 with a positive real part, is called a 
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positive real function* (abbreviated p.r. function). These conditions are 
written 


f(z) real for z real 


Re[f(z)] 20 for Re(z) 20 [687] 


Functions of this sort play an important part in electrical network theory, 
and it is therefore of interest to study their properties in some detail. 
Since the function very often is an impedance, it is customarily denoted 
by the letter Z. The independent variable is the complex frequency, 
which is written \ = ¢ + jw. Being a rational function, Z(\) is given 
by the quotient of two polynomials, and since Z (A) is real for real values 
of , these polynomials must have real coefficients. One may write 
PQ) _ wu (o,0) + j01 (0,0) ; 

ZO) = 90) = alow) + oolow) [688] 
in which ~, and #» are the real parts of the polynomials P and Q respec- 
tively, and 21, v2 are the corresponding imaginary parts. Rationalizing, 
one finds , 


Ug + V2 
R — Mite ihe 
e[Z(r)] ut bog [689] 
and similarly 
1 _ UjU2 + V2, 
ee [zon a? + oe [690] 


Since the denominators in the last two expressions must always be posi- 
tive, it is evident that if Z(A) is positive real, its reciprocal is also posi- 
tive real. © 

The poles of Z(A) are the zeros of Q(n). In the vicinity of one of these, 
one may represent Z(A) by the Laurent series 


ee ba _ 
Z(a) = Qo yt tie ee ey ) +--+ [691] 
For points very close to the pole one may write 
b 
Z | eerie, ey 
| (\) = Q- wy [692] 
Letting 
(A — dy) = pe’* [693] 


*It is not necessary that the function be rational, although in most practical problems to 
which the present discussions are relevant the pertinent functions are rational. 
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and 
b_, = kei® [694] 


one obtains for this vicinity 
k 
Re[Z(a)] & 008 (sp — B) [695] 


As the immediate neighborhood of the pole is explored through allowing 
¢ to vary from zero to 27, one observes that the real part of Z(\) changes 
sign 2s times. It is clear, therefore, that Z(\) can have no poles in the 
right half of the \-plane. It follows furthermore that poles may lie on 
the imaginary axis but that such poles must be simple (s = 1) and the 
function Z(\) must there have positive real residues (6 = 0) so that, 
for the immediate vicinity of such a pole, Eq. 695 becomes 


Re[Z(r)] & : se [696] 


which remains positive for —1/2 < ¢ < 1/2, that is, for values of \ in 
the right half plane. 

Since the same conclusions apply also to the reciprocal function 1/Z(a), 
one recognizes that the zeros of the polynomials P(A) and Q(A) must 
have real parts which are not positive. That is, P(A) and Q(A) must be 
Hurwiiz polynomials.* . 

More explicitly, the expression for Z(\) may be written 


— PO) _ ao + ad + aad? + +++ + aad” 
~ OC) bo + Biv + Bed? + +++ +H Dn d™ 


If the degree of P(A) is higher than the degree m of Q(A), then Z(d) 
has a pole at X = o. Since this point may be regarded as lying on the 
imaginary axis, such a pole, if present, must be simple. Similarly, if 
ay ~ 0, the function Z(\) has a simple pole at \ = 0 if bo = 0; it has 
a pole of second order at \ = 0 if d9 = b; = 0, and so forth. Since the 
point X = 0 lies on the imaginary axis, a pole there must also be simple. 
Recognizing that the same conclusions apply to the reciprocal of Z(\), 
one sees that the positive real character of Z(A) imposes the further 
restriction that the lowest as well as the highest powers of the polynomials 
P(A) and Q(A) can differ at most by unity. 

In examining a given function Z(\) for the purpose of determining 
whether or not it is positive real, it is not sufficient to establish that 
P(A) and Q(A) are Hurwitz polynomials and that their lowest as well 
as their highest powers differ by no more than unity. To form the expres- 


Z(A [697] 


*The term Hurwitz polynomial as used in the present article includes polynomials having 
zeros on the imaginary axis. 
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sion 689 for the real part of Z(A) and examine its behavior over the 
entire right half of the \-plane, however, is 4 laborious procedure which 
one would like to avoid. In this regard it is found helpful to make use 
of the theorem (discussed in Art. 21) which states that if a function is 
analytic within and on the boundary of a given region, the maximum 
and minimum values which the real and imaginary parts of that function 
assume on the boundary are maxima and minima for the enclosed region. 
This region, in the present problem, is taken to be the right half of the 
-plane; its boundary is the imaginary axis. 

If Z(A) is a positive real function, it is analytic in the right half plane, 
and if, for the moment, one assumes that Z (\) has no poles on the imagi- 
nary axis, the theorem just cited assures that the smallest value which 
the real part of Z(\) assumes on the imaginary axis must be smaller 
than any value which this real part may have over the entire right half 
plane. Conversely, if Z(A) is analytic in the right half plane and on the 
imaginary axis, and if the real part of Z(A) on the imaginary axis is 
nowhere negative, this real part must remain positive over the entire 
right half plane, and Z(A) must be a positive real function. 

The stipulation that Z(\) be analytic on the imaginary axis may be 
dispensed with, for if Z(\) has poles on the imaginary axis, it is merely 
necessary to modify this boundary by inserting vanishingly small semi- 
circular detours at such poles, so that the resulting boundary avoids 
these points by passing slightly to the right of them. As shown above, 
the requirement that the real part of Z() shall remain positive on a 
small semicircular detour is taken care of by the stipulation that poles 
on the imaginary axis be simple and that the residues of Z (A) at such 
poles may be real and positive. 

The necessary and sufficient conditions that a rational function Z Qa), 
which is real for real values of \, be positive real may thus be stated in 
a form which does not require an investigation of the real part of Z(A) 
over the entire right half plane. Such a statement reads: 


If Z(A) is analytic in the right half plane, and 

if on the imaginary axis this function has only | 

simple poles with positive real residues, Z(n) [698] 
is a positive real function if Re[Z(jw)] 2 0 

for all real values of w. 


In order to form the real part of Z(A) for \ = jw, one may begin with 
the expression 697 and_in each of the two polynomials group the terms 
with even and odd powers respectively. That is, the polynomials are 
written 


P(A) = m(d) + (A) 


OCA) = ma) + 10) 1699] 
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in which m, and m2 are the terms involving even powers of \, whereas 
m, and mg are the terms involving odd powers of i. It is clear that for 
A = jw, m, and mz are real, whereas m, and my are imaginary. A process 
of rationalization applied to Z(A), therefore, yields 


(m, + 11) (m2 — 2) 


Z(A) = 700 
®) (mz s+ Nz) (m2 — N2) [700] 
from which it is clear that* 
: MM2 — NyN2 
Z(joo)] = (72 — 1% 1 
Rel Ge)] ( My” ee Ne? )_. He ] 


The denominator in this expression represents the square of an abso- 
lute value and hence is surely positive. The condition Re[Z(jw)] = 0 is 
thus seen to be expressed by 


(mm, — nyn2) 20, forrA = jo [702] 


If P(A) and Q(A) are assumed to have the same degree (this assump- 
tion does not restrict the generality of the present argument), it is clear 
that (m,m2 — nn2) is a polynomial of the degree » in the variable )?. 
One may, therefore, write 


(mymz — myM2) = A(Ay? — d?) (Ag? — A?) - ++ (An? — 0”) [703] 
or, for } = jo, 
(mym2 — MN2) =A (A? + w*) (Ag? + w”) ives (An? + w”) [704] 


The constant A must evidently be positive if this expression is to be 
positive for all values of w, since it must still be positive for # —> o. The 
»-roots, which are denoted by 17, X2?,--+An2, may be complex as 
well as real, but since the polynomial 703 has real coefficients, any 
complex roots, if present, must occur in conjugate pairs. Such a pair of 
roots leads to a pair of conjugate complex factors in the expression 704, 
and hence yields a resultant factor which is the square of an absolute 


value. Complex as well as positive real »?-roots, therefore, yield © 


factors in the expression 704 which are surely positive for all real 
values of w. This statement is still true if some of the real \?-roots are 
zero. A negative real \?-root of even multiplicity leads to a factor in 
the expression 704 which is raised to an even power, and hence such a 
factor is also surely positive. However, if there exists a negative real ?- 
root of odd multiplicity (for example, a simple root of this sort), the ex- 
pression 704 is surely negative over some part of the range 0 < w? < 0, 

’ *The functions m(A)} and n(A) should not be confused with u(o,) and v(o,w) appearing 
in Eqs. 688, 689, and 690. m and become identical with « and jv respectively for \ = jw, 
that is, only for ¢ = 0, 
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Tt thus becomes clear that the necessary and sufficient condition insuring 
Re[Z(jw)] = 0 is simply that, in addition to being positive for” = —~, 
the polynomial (mymz — nyne) shall have no negative real d?-roots of odd 
multiplicity. The fulfillment of this condition together with an assurance 
that Q(d) is a Hurwitz polynomial suffices to prove that Z(A) is a positive 
real function provided Q(A) has no zeros on the imaginary axis. If it has, 
one must also establish that these zeros are simple and that the residues 
of Z(A) there are real and positive. If Z(A) is found to be a positive real 
function, one may be sure that P(A) is, of course, also a Hurwitz poly- 
nomial. 

If Q(A) has zeros on the imaginary axis and P()) does not, it is easier to 
establish the positive real character of the reciprocal function 1/Z(\) 
because the latter then has no poles on the imaginary axis although 
Z(x) does. When 1/Z(A) is found to be positive real, it follows without 
further proof that Z(A) is positive real also. 

A limiting form of the positive real function Z(A) results if its real 
part is identically zero for \ = jw. According to Eq. 701 this situation 
requires the condition 


(mymz — Nz) = 0 [705] 
in which \ need not be restricted to pure imaginary values.* If, for the 


moment, none of the polynomials m1, mz, 1, #2 are considered to be zero, 
one may write the condition 705 in the form 


mM, 
Ng 


ny = [706] 
a representation for the polynomial 1 that is possible only if mmz con- 
tains m2 as a factor. Since m2 + m2 is a Hurwitz polynomial, the parts 
mg and m2 have no common factors (the discussion of the preceding 
article shows that the zeros of m2 and m2 separate each other on the 
imaginary axis). Hence the identity 706 can be fulfilled only if 


M, = M2” and sm ~= Mad? [707] 


the factor \” arising from the observation that one may multiply nu- 
merator and denominator on the right-hand side of the identity 706 by 
any power of \. The functions m and m being respectively even and odd, 
it is clear that the integer p is odd. 

The results 707 now yield for the impedance 


(mz + M2)” = 


AUh)is Mz + Ne 


»P [708] 

*It should not be inferred from Eq. 701 that, if mm — nino vanishes for all values of A, 
so does the real part of Z(A), for the expression 701 represents the real part of Z(A) only for 
X = jw, not for A-values in the complex plane. 
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in which one can insert a constant multiplier if desired. Since a pole of 
Z(d) at infinity must be simple, one has p = 1. The function Z(A) thus 
obtained is trivially simple. 

The only other conclusions permitted by the identity 705, if Z(A) is 
not to become identically zero or identically infinite, read 


m=Q0 and m=0 [709] 
or 
m=0 and m=0 [710] 
Correspondingly the impedance becomes 
ny (x) 
ZA) = 1 
0) mz (X) [711] 
or 
my (dr) 
ZA) = 


According to the discussion of Hurwitz polynomials given in the pre- 
ceding article it is seen that these functions Z(\) have simple zeros and 
poles restricted to the imaginary axis. Thus the special form of a positive 
real Z(d)-function whose real part is identically zero for \ = jw is the 
same as that special function whose poles are restricted to the imaginary 
axis. One may say that if Z(A) is a positive real function whose real part 
is identically zero for \ = jw, the zeros of both P(X) and Q(A) must lie on 
the imaginary axis. 

Since Z(A) is by definition a positive real function, the residues in all 
its poles are real and positive. A partial fraction expansion of such a 
Z(a)-function reads 

ko , ke ke kg ke 

east le bee te ao 

The terms in this expression, except for the first and last, are pairs of 

conjugates. Since the poles are restricted to the imaginary axis, the conju- 

gate of A, is —),; and since the residues are real, conjugate poles (which 

normally involve conjugate residues because Z() must be real for real 

\’s) yield identical residues. The first and last terms in Eq. 713 represent 
possible poles at \ = O and A = &. 

Combining conjugate complex terms in the expression 713 gives 


+: 7 -+Ropd [713] 


seb Ropd [714] 


Since 
y= Ajoy and ye? = —w,? [715] 
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Z( je) eee 


k Qkejw . ‘ 
= 8 tt eapfo [716] 
wg — WwW 


0 
je + w” 
This special form of the function Z(jw) is of practical interest because it 
represents the impedance of a lossless network. The function Z(\) in this 
case is evidently on the borderline with respect to the property of being 
positive real. 
In terms of the factored forms of the polynomials P(A) and Q(A), this 
function Z(jw) has a representation of the form 


H(w*, — w”) (ws aa w*) oes (w7op—1 — w?) 


jo(we = w”) (w4 > w”) tee (w*2p—2 -* w”) 


752 


Z(jw) = [717] 
in which H is a positive real constant. 

Even in this special limiting case the real part of Z(A) is zero only on 
the imaginary axis of the \-plane. For complex \-values one has 


Z(d) = R(ow) + JX (ow) [718] 


in which R and X are the real and imaginary parts. According to the 
Cauchy-Riemann Eqs. 12 and 13, 


Ge ua da ee [79] 
Since R(c,w) is identically zero for ¢ = 0 and positive for « > 0, it 
follows that 0R/dc is’ positive and hence that 0X/dw is positive for 
o = 0. In other words, when all the poles of Z(\) lie on the imaginary 
axis and its real part there is identically zero, one has 


(5)... = ae) >0 for—~ <w< [720] 
A similar relation then also holds for the reciprocal function. 

This result states that the real function Z(jw)/j of the real variable 
w has a positive slope for all values of w. As a consequence it follows that 
the zeros and poles of Z(), which are all simple and lie along the imagi- 
nary axis, must mutually separate each other. This alternation of zeros 
and poles may (with reference to the notation in Eq. 717) be expressed by 


0 < wy < we < +++ < Wap-e < Wap-1 < © [721] 


It is significant to observe that a positive real function in this limiting 
case has properties identical with those of the function ¥(z) discussed in 
the previous article (see Eq. 625 and following). A function of this type 
is represented by the reactance or susceptance of a lossless electrical 
network. One may, therefore, state that the polynomials whose ratio 
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represents the reactance or the susceptance of a lossless network are the 
even and odd parts of a Hurwitz polynomial. The converse of this state- 
ment is likewise true. When the expression 688 represents such a re- 
_actance or susceptance function, one of the polynomials P(A) or Q(A) is an 
even and the other an odd function of \, as shown by Eqs. 711 and 712. 
Returning to the general case, one may discover additional useful 
properties of the positive real function by considering its linear fractional 
transformation 


1-20) _ QQ) = PQ) 
1+ZQ) QQ) + PQ) 


As shown in Art. 24, this transformation maps the right half of the 
Z-plane upon the interior of the unit circle in the z-plane (and vice 
versa), the imaginary axis of the Z-plane becoming identified with the 
unit circle in the z-plane. If Z(A) is a positive real function, points in the 
right half of the \-plane correspond to points in the right half of the 
Z-plane and hence to points within the unit circle of the z-plane. One 
may state 


If Z(d) is positive real, |z(d)| S 1 for Re(d) 2 0 [723] 
It is readily appreciated that the converse must be true also; that is, 
If |z(a)| S 1 for Re(a) = O, then Z(d) ts positive real [724] 


Moreover, if Z(A) is positive real, then z(A) must be analytic in the right 
half plane and on the imaginary axis, for 1 + Z(d) cannot be zero there. 
According to the principle of the maximum modulus (discussed in Art. 
21), the largest value which |z(A)| assumes on the imaginary axis of the 
\-plane must be a maximum for the entire right half of that plane. 
Conversely, one may state that if z(A) in Eq. 722 can be shown to be 
analytic in the right half plane inclusive of the imaginary axis, and if on 
this boundary \2(a)| <1, this inequality must hold for the entire right 
half plane and, according to the statement 724,.Z (A) must be positive real. 
It is now interesting to consider the function 


2(\) = [722] 


_ PQ) 
| FN ORN) ve 
in which 

P*(d) = my(d) + meo(d) [726] 


Q*(A) = me(d) + m (A) 


Comparing these relations with Eqs. 699, one observes that the poly- 
~ nomials P*(\) and Q*(A) are formed from P(x) and Q(A) by interchange 
of their odd parts. In terms of Z*(A) one now forms the expression 


418 FUNCTIONS OF A COMPLEX VARIABLE (Ch. VI 


analogous to 722, namely, 
1—Z*(A) _ Q*A) — PFA) 
1+Z*(4) OFA) + PFA) 
According to Eqs. 699 and 726 it is readily seen that 
O*(A) + P*(A) = QQ) + PQ) [728] 
Since it has already been shown that the polynomial P(A) + Q(A) has 
no zeros in the right half plane or on the imaginary axis when Z (r) is 
positive real, one observes in that event that 2*(d) is analytic in the entire 
tight half plane inclusive of the imaginary axis. 
From the mapping properties of the function 727 it follows that \2*(jw)| 
< 1 for Re[Z*(jw)] 2 0. The latter inequality is found, from Eqs. 725 
and 726, to be expressed by 
(“te — i) a (729) 
A=jJw 


Ms” = ny? ~ 


Z*(\) = [727] 


or, since (mo? — m7”) for \ = jw is the square of an absolute value, one 
has more simply 
(mymz — mn) = 0, for’ = jw - [730] 


This is the condition 702, which is fulfilled if Z(A) is positive real. In 
this event the present argument is thus seen to yield \z*(jw)| < 1, and 
since 2*(d) is analytic in the right half plane inclusive of the imaginary 
axis, the principle of the maximum modulus enables one to conclude that 
|2*(A)| S$ 1 throughout the entire right half plane. According to the 
statement 724, therefore, the function Z*(A) is then seen to be positive 
real. This conclusion is summarized by the statement: 


If the rational function Z(d), given by the 
quotient of polynomials P(X) and Q(A), is 
positive real, the rational function which 
results after the even or odd parts of these 
polynomials are interchanged is also posi~ 
tive real. 


[731] 


Since Z*(a) is positive real, it follows that P*(\) and Q*(A) are Hurwitz 
polynomials. Hence one may state 


If the quotient of polynomials P(A)/Q(A) 

is a positive real function, not only these 

polynomials but also those which result [732] 
from an interchange of their even or odd 

parts are Hurwitz polynomials. 
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Further practically useful results are obtained from an investigation 
of the properties of positive real functions in polar form. The first step 
in this direction is the introduction of the relations 


~-A 1+2 
aN) = a or ae a ae [733] 
and 
_ZO)-B 
~@)= 7) 4B [734] 


The positive real constants A and B have any values satisfying the 
relation 


Z(A) =B | [735] 


I By means of the transformation 733 the interior of the unit circle in 
the z-plane is mapped upon the right half of the A-plane. The positive 
real function Z(A) relates points in the right half \-plane to points in the 
right half Z-plane, and these in turn are mapped upon the interior of the 
unit circle in the w-plane by means of the transformation 734. As a 
consequence of the relation 735 between A and B, the origin in the z-plane 
corresponds to the origin in the w-plane, that is, 


w(0) = 0 [736] 


The function Z and its independent variable \ are now represented in 
the polar forms 


d= pe* and Z = re” [737] 


In terms of the variables thus introduced, Fig. 51 illustrates the identical 
transformations 733 and 734. Part (a) shows how the concentric circles 
(p = constant) and the radial lines (¢ = constant) of the right half 
d-plane appear inside the unit circle of the z-plane, and part (b) similarly © 
illustrates the appearance of the polar representation of Z within the unit 
circle of the w-plane. In each case the unit circle itself represents the 
imaginary axis of the \- or Z-plane, the left-hand point on the circle 
(g = w = —1) corresponding to the origin in either of these planes and 
the right-hand one (z = w = 1) to the point at infinity. 

The imaginary axis of the \-plane thus corresponds to |z| = 1. There- 
fore, if Z(A) is a positive real function, points on the locus |z| = 1 yield 
Z-values which are in the right half or on the imaginary axis of the 
Z-plane and hence within or on the unit circle of the w-plane. That is, 
one may state that 


|w| <1 for |z| =1 [738] 
In view of this result and the condition w(0) = 0,.as expressed by 
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Eq. 736, it is recognized that. Schwarz’s lemma (see Art. 21) enables 
one to make a considerably stronger statement, namely, that 


[w| < |z| for |z| <1 [739] 


in which the equals sign holds only if it holds identically. 

This result is readily translated into an expression involving \ and 
Z(d) since it states, with reference to Fig. 51, that to any concentric 
circle within the unit circle of the z-plane there corresponds a concentric 


Fic. 51. Relevant to the derivation of the properties of a p. r. function expressed 
in polar form. ; 


circle within the unit circle of the w-plane which is at least as small or 
smaller. If one considers the value p = A, which implies no restriction 
since the value of A is arbitrary, one observes by inspection of the figures 
that the following condition obtains: 


lols |o| for lol s 5 [740] 


This result may be written in the alternate form 


larg. Z| < |arg.»| for 0 < arg. nN <5 [741] 


Again the equals sign in the first of these inequalities holds only ifit 
holds identically. 

’ The remarkable part about this result is that although it appears to be 
stronger, it nevertheless is contained in the statement Ref Z(d)] 2 0 for 
Re(d) 2 0, expressing the positive real character of Z(A). Since the 
present result is readily seen to include this statement in terms of the 
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real parts of \ and Z(\), one concludes that the two forms of expressing 
the positive real character of Z(\) are entirely equivalent. The statement 
in terms of the angles of \ and Z(A), just derived, is a translation of the 
one in terms of real parts into its equivalent polar form. 

An additional inequality may be obtained from these considerations. 
With reference to the transformation 733 and the polar representation 
737 for , let p = A. This choice is always possible since the value of A is 
arbitrary. Equation 733 then yields 


e*—-1 | 
2= +1 = 7 tan 2 [742] 
’ The condition 739 may therefore be written 
|w| < tan $ for0 <¢ | [743] 
Using the inverse of the transformation 734, which reads 
Z i+w 
Bo 1l-w [744 
and taking note of Eq. 735 ,one obtains the condition 
1 — tan $ IZa)| 1+ tan? 2 
He < ———— > for0 <¢ <= [745] 
¢ Z(p) 2 
1 + tan 2 1 — tan 2 


For ¢ = +/2 the lower and upper limits in this relation are zero and 
infinity, but for ¢ = 7/4, for example, one has 


IZ()| 
0.414 < Zo) Gi 


The restriction implied by the condition 745 is thus seen to illustrate 
another interesting property of positive real functions. 

A property which is rather obvious, but nevertheless practically useful, 
is expressed by the statement: 


If Z(A) and W(A) are positive real func- 
tions, {(\) = Z(W) is again positive real, 


< 2.414 [746] 


[747] 


In other words, a positive real function of a positive real function is 
also positive real. As an example one may consider the simple positive 
real function 


W(r) = k [748] 
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The above statement asserts that if Z(A) is positive real, Z(1/X) is also 
positive real. Again, one may have 


Wa) =rA+ ; _ [749] 


which is readily recognized as being positive real. Then if Z (A) is positive 
real, one may state at once that Z(A + [1/h]) is likewise positive real. 
Use of the condition 741 enables one to make the further assertion that 
the angle of the function Z(\ + [1/d]) must, for every value of ) in 
the right half plane, be smaller than the angle of Z(a). 


PROBLEMS 
1. Verify that the real and imaginary parts of the function 
e* cos y + je* sin y 
satisfy the Cauchy-Riemann equations. Show that this function must be e* where 
zg=a+sy. 
2. Using this reasoning, show that 
sin x cosh y +7 cos x sinh y = sinz 
cos x cosh y + 7 sin x sinh y = cos z 
3. Let the sphere in stereographic projection be of radius 1, so that its equation is 
| Pty +f — 2 =0 
Show that the point £, 7, ¢ on the sphere corresponds to the point 


2g 2n 
=o y= 

Pot rat 
in the plane ¢ = 0. Show that if 222, = —4, 21 and ze correspond to diametrically 
opposite points. ; 

4, For w = sing = u + jv, sketch the curves « = constant and v = constant in 
the z-plane and verify that they are orthogonal. 

5. Actually integrate z? dz around the following contours, and verify that the results 
are zero: (a) Around the square having vertices at 1+ 7,1—j, -—1 —j, -1 +). 
(b) Around the triangle having vertices at 1 +7, —1 —j, —1 +). 

6. Carry out the integration for (1/z) dz about the following contours: (a) The 
square with vertices at 1, 2,2 +j, 1+ . (b) A circle of radius 1 about the origin as 
center. Why is the result not zero in part (b)? 

7. As an example of analytic continuation, consider the following: (a) Find the 
function represented by 1 + 2 + 2?---. (b) Determine its circle of convergence, and 
find the singularity on the circumference. (c) Determine the series for this function 
about the point zs = —4}. (d) Verify that this series converges for points for which the 
original series diverges, in particular at the point z = — . ‘ 


8. As an example of a function having a natura] boundary, consider 
f@)H1l+e24tet+ Hh tal® yess 


x 
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(a) Show that the radius of convergence is 1. (b) Verify that for every point of the 
‘Pp Ps . * 
form e°*/*/?" & and p being integers, all terms after 2?” will be 1, so that the function is 
singular at such points. (c) Show that these points are dense on the unit circle, in the 
sense that there is no interval free of them. As a consequence the function cannot be 
continued outside. 
9. In deriving the Laurent series for a function, the integral formula for the coef- 

ficients is not practical. The function 

sin Z 

a 

has a pole of order & — 1 at the origin, and its Laurent series would be found by 
dividing the series for sin z by 2”. Using this idea, find the Laurent series for 

COS 72 

(l= 3)? 


about the point z = 1. 
10. In deriving a Laurent series, the method of partial fractions is useful. Fo 
example find the Laurent series for 


Peat reet 
27(1 = 2) 


about z = 0 and z = 1. What are the regions of convergence? 
11. Sketch the lines ~ = constant and» = constant near the origin for the functions 


w=1-+24 and w=1+2° 


which have saddle points there. 

12. Find the value of the integral of csc 2 dz taken around the following contours: 
(a) A circle of radius 1 and center at the origin. (b) A circle of radius 4 and center at 
the origin. (c) A circle of radius 2 and center at z = 2. 

13. Find the integral of sin z/z* for k = 1, 2, 3, and 4 taken about a circle of radius 
1 and center at the origin. 

14. Determine the polar and rectangular forms of the function 

sinh 2z 
f@) = n sinh z 
in which is an integer, and show that the Cauchy-Riemann equations are satisfied. 

15. Show that f(z) in the foregoing problem is an entire function, and determine 
the distribution of its zeros. 

16. Suppose the derivative of f(z) = u(x,y) + jv(x,y) is written 

af =a+jb = J/ a 2 ei tan! v/a 


‘i 


Tf f(z) satisfies the Cauchy-Riemann condition equations, show that one may have 


or a oe d ae 


Ou Ov re ie 
Ox oy oy 


a=5 and 6 = 


For the following functions 
2”; V3; cosh 2; sinh g; e??; Inz; cos! z 


( 
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compute the polar form of d/ ‘/dz through first forming « and v and obtaining aand b 
by one or the other of the above pairs of relations, and alternately through forming 
df/dz directly and then putting the result into its polar form, Check the three results 
for each function. 

17. Derive the following identities: 


(a) sinhjz = 7 sinz ‘ (b) singz = j sinh z 
(c) coshjz = cosz (d) cosjz = —cosh 2 
(e) tanhjz = j tanz (f) tanjz =7 tanh z 
(g) sinh! jz =jsin7!2z (h) sin~! jz =jsinh™!z 
(i) cosh—1z =jcos"!2 (j) cosh7!jz = 7 cos! jz 
(k) tanh7!jz = 7 tan7!z (1) tan7!7z = 7 tanh7)z 
(m) sinh7!z = In (2 + V2? + 1) (n) cosh7!z = In (2 + Vz? — 1) 
1 1+2 1 +1 
ely were. SD pe ge 
(0) tanh s = 5in(5 =) (p) coth~1z 5in(E*S 
(q) sin? = 5 In Ge + VE) (r) costs = iin (e+ V1) 
7 1, /1 =) i 1 é = 
Wig. ees 1d Wegees ee 
(s) tan'z BG a (t) cot7z ria joy 


18. Considering the function w = In z, draw in the w-plane the figure corresponding 
to the rectangle in the z-plane defined by = 
- @=-3; 1Sy82) @W=3; 185982) 
(-3S5283; y=1) (-3852353; y =2) 


19. In a complex plane draw the figure bounded by arcs of concentric circles with 
radiir, = 10 and v2 = 30 centimeters, and radial lines making angles of 30 and 60 de- 
grees with respect to the positive real axis. Using the function w = In z, transform this 
figure into a rectangle and specify the equations of the straight lines forming its sides. 

20. Considering the function w = sin™!z = «+ jv, show that the lines 4 = 
constant and v = constant in the w-plane are transformed respectively into central 
hyperbolas: 

Bre hE, 
sin? cos? % 


"and central ellipses 


ae yee See 
cosh? v © sinh?» 


in the z-plane, and demonstrate that these families are confocal and orthogonal. Draw 
a representative number of these loci. 

21. With reference to the conformal map described in the previous problem, con- 
sider the strip —7/2 < u <7/2 in the w-plane and determine the regions in the 
z-plane corresponding to the portions of this strip defined by » > 0 and v < 0 respec- 
tively. The closed rectangular boundary in the w-plane joining the four points « = 0, 
v|= 0.002; « = 1, » = 0.002; « = 1, » = 0.02; w = 0, v = 0.02 is traversed counter- 
clockwise. Determine the corresponding contour in the z-plane and indicate the 
direction of traversal. 
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22. Continuing the study of the function w = sin! z, describe the structure of its 
Riemann surface pointing out the location of branch points, appropriate positions of 
branch cuts, and the number of leaves in the surface. How many leaves of the Riemann 
surface are occupied by the closed contour in the z-plane corresponding to the rec- 
tangular one in the w-plane joining the four points: 


pe SZ eek Ga et gee 
“= cou ~ "4? ? “=a? 


23. Consider the function w = 1/z and show that it transforms circles in the 
z-plane into circles in the w-plane and vice versa (including straight lines as limiting 
forms of circles). Draw a family of circles in the w-plane corresponding to the straight 
lines y = kx + (with & and c real) in the z-plane, choosing a fixed value for k, for 
example, & = }, and various values for ¢ as, for example, c = 0, +:1, +2, etc. Alter- 
nately consider the families of straight lines « = constant and y = constant. 

24. For the function w = z/(a — 2) with a = 1 +71, map the loci in the z-plane 
corresponding to the linesv = 44u + 1,0 = }4u,v = gu — 1 in the w-plane. Indicate 
corresponding directions of traversal by placing arrows on the loci. By shading indicate 
the regions in the z-plane corresponding to (a) that part of the w-plane below the line 
v = lu, (b) that part of the w-plane below v = }4u and above the real axis. 

25. The arc of a circle of radius 10 and center in the first quarter of the z-plane 
passes through the points x = 0, y = 0 and x = 3, y = —1. Show that the region 
enclosed by this arc and the chord passing through the same two points may be 
mapped in the w-plane as an angular slit with its vertex at the origin by means of the 
transformation w = 2/(a — 2). Determine the complex value of the constant @ and 
calculate the angular aperture of the slit as well as its orientation in the w-plane. 

26. A region A in the first quadrant of the z-plane is bounded by the arcs of three 
circles passing through the origin, two of them having their centers at the points 
x = 1,7 =Oand x = 2, y = 0, whereas the center of the third circle is at the point 
x = 0, = 2. Asecond region B is bounded by the arcs of two circles passing through 
the origin with centers at x = —2,y = Oand atx =0,y = —1. Show that each region 
can be transformed into a rectangular strip in the w-plane through the transformation 
w = 1/z and determine the boundaries of these corresponding rectangular regions. 

27. Show that the equation 


A(x? + y?) + Be +Cy+D=0 
in which the constants are subjected to the condition 
B?+C? —4A4D>0 


Sar 
9210; w= ~-7, 9= 10 


is a general equation for circles and straight lines, and that this property is preserved 
when z = x + jy is replaced by 1/s. 
28. For the transformation 
aes l—3z 
1+2 


let z = re and consider the concentric circles y = constant and the radial lines 
@ = constant, and compute the location of the center and the radius of each corre- 
sponding circle in the w-plane in terms of r and @ respectively. Plot a representative 
family of curves in each plane. 

29, For a function w = f(z) = « +2, satisfying the Cauchy-Riemann equations, 
- Joci for # = constant and» = constant are plotted in the z-plane. If the slopes of these 
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curves are denoted by (dy/dx) —constant and (dy/dx), constant respectively, show that 
at any point these derivatives have negative reciprocal values and hence that the 
families of curves intersect at right angles. 

30. Study the function w = e)-) in the vicinity of the point 29 = 4(¢ + 4) 
through plotting in this region a number of curves corresponding to « = constant and 
»v = constant fora = 5'+j0 and’ = 0 495. 

31. Let », = 0u/dx and v, = 0u/dy be the velocity components of a conservative 
hydrodynamic field, u being the real part of the analytic function w = f(z). Show 
that the magnitude of the velocity is given by v = |df/dz|. 

32. Write down the mutual conditions to be satisfied by two functions P(x,y) and 
Q(x,y) if the line integrals 


if ea - ea) and f Qa +Pa 


are to be independent of the path c, and compare the results with the Cauchy-Riemann 
equations. 


33. Sis a path joining two points A and B in the complex z-plane, and 2), 22,°--, 
Zn, = B are n uniformly spaced points along this path. The definite integral along the 
path is defined as 


n 
f fle) de =limit 2 flex) An, with Asy = 24 — ta 
S tor 


Show first that 


| E flor) Ase 
k=1 


froe|\s felled 


34, With reference to the situation given in Prob. 33, suppose f(z) is bounded along 
S and that its largest absolute value on this path is M. Let the length of the path S 
between A and B be denoted by L. Show that 


| ff tee 


35. Let f(z) be regular on and within a closed contour C. The length of the contour 
is L, 2 is an internal point, and L’ is the circumference of a circle whose radius is the 
minimum distance from z to the contour. By means of Cauchy’s integral formula, 
show that 


SX lyes)|- lanl 


and thus that 


SML 


If@|s a (M defined in Prob. 34) 


36. By direct integration along the sides of a rectangular contour joining the points 
(x,y); (% + x0,y); (% + x0,y + yo); (4,y + yo) check the relation 


fe dz =0 
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37. Consider integration of the function e*/z around a closed contour formed by 
two semicircles in the left half plane concentric at the origin, one having a large radius 
R, the other having a small radius p, and those portions of the imaginary axis joining 
the semicircles. The integral is effectively expressed as the sum of four parts corre- 
sponding to the two linear path increments and the two semicircular ones for which 
z = Re and z = pe. Observing that the function e*/z is regular on and within this 
contour, obtain in the limit R — », p — 0, the result 


foe -5 
o 9 2 


38. Compute the definite integral 
i 
1 i? 


along a path consisting of the linear increment from the point 1 to the point |z| on the 
positive real axis, followed by a circular increment concentric at the origin, expressing 
the result as the sum of two parts corresponding to these path increments. Thus show 
that the value of the integral equals In z. 


39. Consider integration of the function w = e~* around a closed-rectangular 
boundary joining the four points z = —a,z =a,z =a+jb,z2 = —a+ jb, a andb 
having positive real values, and express this integral as the sum of four parts corre- 
sponding to the four linear path increments. Using the result 


f e~? dx =4AVa 
0 


obtain in the limit a — © the more general one 


_3 
f e-*" cos 2hx dx = 2 — Va 
0 


40. Integrate the function e—* around a closed contour formed by the linear incre- 
ment from 0 to R along the positive real axis, followed by the circular arc from R to 
Ref!) and completed through a linear increment from Re’*/® to 0. Considering the 
limit R > « and again using the value of the error integral given in Prob. 39, obtain 


the result 
2 . 1 fr 
2 zh a) cae [5 
f cos x a= f sin «? dx a 


which, with the substitution x? = u, yields the Fresnel integrals. 
41, Expand the following functions in Maclaurin series: e”, cos z, sin z, cosh z, sinh z. 
42, Through expansion in Maclaurin series, check the following expressions: 


2 2% gt 
INitz)=2-D+y-Gt 
- —1)(@ = 2), 
(tay at pep PAD gay MO OD sy. 


43, Using the result that the radius of the convergence circle of the power series 


F (2) = ao + a1 + ae? +++ + ane™ +°°- 
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is given by 


1 
a= imi | 375 | 
show that R = « for the series in Prob. 41 and R = 1 for those in Prob. 42. 

44, Find the radius of convergence of the following geometric series: 1 + 2 + 2? 
‘pe gh feces fg fore, 

45. Expand the function (2? — 1)~? in a power series about the points z = 0, 
2 =j,% = 4, and find the radius of convergence in each case. 

46. Utilizing the fact that the convergence circle reaches to the nearest singularity, 
compute the radius of convergence of the function [(z ~ 1)(2 + 2 — 73) (22 + 9)]~! 
expanded in a power series about each of the points: z = 0, 2 = 0.2 — 73, 2 = 72, 
= —8. 

47, Assuming that the function given in Prob. 46 is expanded in power series about 
points on the line y = —2x lying at x = 1,0, —1, —2, make sketches showing the 
various regions of convergence and their overlapping portions. 

48. Show that the expansion of the function 


fl) = yi * tte dy 


about appropriate points 29 on the positive real axis reads 


f@) = Sane" 


n=0 (Z0 = ijrtt 


and verify that the radius of convergence is R = |zo — 1|. 
49. Using the result = 
1 


fo tee; for x >0 
0 z 


find the values of the following integrals: 


f e-™ cos yt dt de e-™ sin yt dt 
0 0 


50. Show that the function 1/(1 + Vz — 2) has two different power series expan- 
sions about the point z = 0 with radius of convergence equal to 1 and 2 respectively. 

51. Using the formula for the coefficient }, of the Laurent series, and assuming 
that the points ¢ and 29 are not functionally related, obtain the relation 


ee 
nt in +1 030 


applicable for negative as well as positive integers 1. 
52. Consider the rational function 

P 1 (2) 

Paz) 


in which P; and P2 are finite polynomials. Assume that P2(z) has a zero of multiplicity 
a at z = 2 and write Pe(z) = (2 — 29)*Fe(z). Letting ¢ — zo = pe” in the formula 


R@) = 
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for Laurent coefficients and recognizing that p may become arbitrarily small, obtain 
the result 


b at, 180) 
~*  Fe(Z0) 


Also show that the formula yields b_(a+,) = 0 for all positive values of p, and using 
the result of Prob. 51, get 


db este I 
~~ @ = 1)@ — 2)*++ @ — p) dm? * 


which contains the specific result 


1 ge-1 


ce aa @ —1)!oe7 


b_a 


53. Considering the Laurent expansion of the rational function described in Prob. 
52, show that the coefficients b_»* * - b-1 have real values when 2» is real and that they 
have conjugate complex values for conjugate values of zo. 


54, Show that the Laurent expansion of the function 
f(a) = elt/@-UA)) 


about its essential singularity at z = 0 has coefficients given by the formula 
2a 
ba(t) = tf cos (np — tsing) dp 
Qr 0 


which are Bessel functions of the first kind. 


55. Suppose f(z) has an isolated singularity at 2 = 20 and the radius of convergence 
for its Laurent expansion about this point is R. For points on a concentric circle about 
Zp With radius r < R show that the series representation takes the form 


(2) = Ao +X (4a + An) 608 np +J(4n ~ A) sin 6) 
in which 
2a 
An =z ff Slee + relent ap 


56. Find Laurent expansions for the following rational functions about their poles: 


522 + 32 +2 1 32 
4227 + 82 +1 427 + 82+1 427 + 8241 


57, Find the partial fraction expansions of the functions given in Prob. 56. 

58. A polynomial P(z) has a double zero at 2 = 1, a zero at z = j, and a zero at 
z = 3(1 —7). Assume the coefficient of the term z‘ to be unity. Find the Laurent | 
- series for 1/P(z) about 2 = 1. 

59, Compute the residues of the following functions in their various poles: 


52 +241. a2 4 Sells git 


2(2? + 3z + 8) 22 — 3ein/6 z 


60. A rational function f(z) has simple poles at the points 2 = 21, Z2,°** Z, only, 
with the residues 1, k2,- ‘+ &n respectively. Find analytic expressions for the residues 
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of the function 


_ of) 
VO) = FG-5) 


in all its poles assuming that ¢ does not coincide with any of the points 21° ++ Sn. 
61. Continuing Prob. 60, evaluate the contour integral 


1 
— d 
Oni gf W(z) dz 
in-which C is a circle about the origin with a radius R sufficiently large to enclose all 
the poles of y(z). If f(z) is regular at infinity, show that 


g f(z) dz 
c| 2 


and hence that the value of the above contour integral must be zero. Thus obtain the 
following form for the partial fraction expansion of f(z): 


3~0 as R- © 


us 1 1 

J@) = 40) +2 (ee +5 
62. Generalize the result of Prob. 61 to the extent of allowing f(z) to possess an 
unlimited number of isolated simple poles (it is then no longer a rational function, 
of course). Through choosing the circular contour in such a way that ‘no pole lies upon 
this path at any stage in the limiting process indicated by R > ~, conclude that the 
above partial fraction expansion of f(z) yields (in the limit » — ©) a convergent 
series (theorem of Mittag-Leffler). Apply the result to verify the following expressions: 


v 1,3,5+++ 1 1 
py —1)” pas 
COs 7172 ne fen ere 1) : v _ 
, 2 2 
1,3,5++- 1 1 
wtan@w = — +— 
y=—1—3,-5--[ PP 
2 2 


Convert these into summations over positive integers only. 
63. If a rational function has a zero at infinity with a multiplicity of two or more, 
show that the sum of its residues is zero. 


64. Find the values of the contour integrals 


dz 2dz 
h - § ws and - fe 


when the contour C is defined as: (a) a circle of radius R < 5 with center at the origin, 
(b) a circle of radius R > 5 with center at the origin; (c) a triangle with vertices at 
the points ~—6, 76, —j6; (d) a triangle with vertices at the points --6, 76, 710~® 
65. Show that the function 

f@) = : ; 

2 + (—a +b +c)2? — (ab +ac — be)z — abc (a? + ab + ae + bc)(z — a) 
has a removable singularity at the point z = a through demonstrating that (g — a) X 
f(z) is regular in this point. Compute the residues of f(z) in the remaining poles. 
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66.: Considering the function 


f(@) = 


within a circle of radius p about the origin, show that for a nonzero p however small, 
the circle nevertheless encloses an unlimited number of poles. 


67. Starting from the definition of the Tschebyscheff polynomials 


cos ( cos~! g) 


In (2) a Qn-1 


derive the alternate relation 


74() = SHIVER + ~ VI 
£7 = Qn 


and show that these functions are finite polynomials in z. 
68. Find the points of stagnation of the function 


f(@) -»(++%) tins 


in which v, R, and C are real constants, and map their locus in the z-plane as the 
quantity C/(47v0R) varies continuously from — © through zero to «. 

69. For any function w = f(z) = u(x,y) + jv(x,y) satisfying the Cauchy-Riemann 
equations, regard u(x,y) and v(x,y) as representing altitude functions and thus 
defining a pair of surfaces s; and sz. Show that a maximum or minimum in the altitude 
u coincides with a maximum or minimum in the altitude v and with a saddle point of 
the function f(z). 

70. Find the saddle points of the function 


f() = et—avet+e 


in which ¢, a, and 0 are real constants. Show that they always lie in the real or imagi- 
nary axes and that their positions are controlled by the ratio é/a. 

71. Inside a closed contour C the single-valued function f(z) is regular and con- 
tinuous except at poles in the points 41, de, - - - a, with the residues di, bp -- - by. If Lis 
the length of this contour and M is the maximum absolute value of f(z) on the contour 
show that 


72. Consider the rational function 


I@) _ Go + 41% + Gee" 4 +++ + ame 
© By + biz + bag? +++ + bye” 


and let ki, ke, +** Rp (p S 2) be the residues in its poles which may or may not 
be simple. Cis a circle about the origin with radius R large enough to enclose all the 
poles, Points on this circle are denoted by z = Re’. Show: 
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(a) That : 
24 
dulks + ha +°** + hel S rf IT(e)| ae 


(b) If m > (m + 1) that 
| (ki the te++ +hp) =0 
(c) If m = (m +1) that 


(kr tha tess +h) = 


How can one obtain an expression for the sum of the residues when n < (m -+ 1)? 
73. Evaluate the following integrals involving functions of a real variable x 


f © x? dx and f 2 dx 

o xi tl o 68 +1 

through replacing x by the complex variable z = x + jy and using the methods of 
contour integration, choosing as the closed contour the real axis from —R to R and a 
semicircle of radius R which is regarded as having unlimited magnitude. Consider 


whether the semicircle should lie in the upper of lower half plane, and show that the 
contribution of the semicircular path increment to the value of the integral is negligible. 


74. 2 = e” is a point on the unit circle about the origin, and y (cos @, sin 8) is a 
rational function of sin @ and cos 6. Show that 


[ova=§$ foas 


in which f(z) is a rational function of z, and C is the unit circle about the origin. 
75. Using the results of Prob. 74, verify the following integration 
aie dé _ ar 
fo 1+a%—2acos@ |a? —1| 


76. Within a closed contour C the function f(z) has simple poles at the points 
21, 22, °° * Z, With the residues bj, be, +++ b» respectively. Inside the same contour the 
function cot wz has p simple poles. Show that 


Dp n 
PI cotrzdz = LS f(m +k) +r Do by cot rx 
Bc k=l k=1 


in which m is an appropriate integer. 

77. Through replacing x by z, choosing a closed contour consisting of one large 
semicircle of radius R, a small one of radius p (both in the upper or lower half plane) 
confluent with portions of the real axis, and showing that contributions due to integra- 
tion along the semicircular path increments vanish in the limits R -> © and p —0, 
obtain the result 


It is now proposed to find the integral of the same function between the limits 0 and » 
through considering the change of variable x — (—~) in the integration over— © to0, 
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noting that 
[In (—«)]? = (Iinx + jr)? = (nx)? — 9? 4720 Ing 
Thus, after obtaining the collateral results, 


2 dx v ° Ine 
f ite 72 and , tae 8 


show that 


bo too 


78. By means of complex integration, following a pattern similar to that suggested 
in Prob, 77, check the values of the following definite integrals 


f °= cosx di we f ° sing a wet | h 
=—— ———. dx = —— sinha 
oO «2+? 2a 0 x%* +4? 2a 


Express cosz and sinz in terms of the exponential function and correspondingly 
represent each integral as a sum of components. Choose composite contours consisting 
of the real axis and a semicircle of radius R — ~, placing the latter in the upper or 
lower half plane according to the requirement that this path increment shall contribute 
nothing to the value of the contour integral. Note that the second of the two integrals 
above remains proper for a > 0. If the constant a is regarded as real throughout the 
process of evaluation, is the result nevertheless still valid for any complex a-value, 
for example, for a pure imaginary value? 

79. If f(z) is a rational function having a zero at infinity with a multiplicity equal 
to or greater than one, .and if F(¢) is a real function of the real variable ¢ which is zero. 
for ¢ <0 and has the property that F(t)e~** +0 for {> » (assuming z to have a 
positive real part), the following pair of mutual integral relations hold: 


Fi) = af. f@e4dz f(z) = fi roce dt 


Check these relations for the following pairs of functions: 


f@) = oF F(t) = ite7* 
{@) =a = Z F(t) = sin at 
f®™ =-are@ ms 7 F(t) = cos at 
fi) = ear F(t) = e~* cos bt 


The first of the two integrals is to be evaluated using complex integration along a 
closed contour consisting of the imaginary axis and a large semicircle of radius R — 
lying in the left or right half plane according to whether ¢ > 0 or ¢ < 0 (see Art. 26 of 
Ch. VII). The second integral in the above pair is to be evaluated according to the 
methods of real integration. 

80. Expand the following rational functions in partial fractions and check the 
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relations regarding the residues as stated in parts (b) and (c) of Prob. 72: 
, (a) 524(22 + 22 — 1) (b) a(22 + 22 — 1) 
(22 — 2)(22? —z — 1) (zg — 3)?(2 + 8) 
2? +22~1 1 
© Gaye +8) @) Gaye 5) 


81. Discuss the structure of the Riemann surface associated with each of the 
following functions: 


22 —3 
Zz 


(a) w= Ws (b) 


()w=V@-ae—b? (4) w=V@—5)@ + 6 + 13) 


pointing out the location and order of branch points and other singularities as well as 
the character of the point z = ~. 

82. Discuss the Riemann surfaces in both the w- and z-planes associated with the 
function w = 23/5, 


83. For the function w = ~/z consider the leaves of the Riemann surface defined 
by the statements: 
—m < arg. <7 corresponds to leaf I. 
a < arg. 2 < 3m corresponds to leaf IT. 
3x < arg. 2 < 5x corresponds to leaf III. 
In the w-plane plot loci corresponding to the following straight lines in the z-plane: 
(a) Parallel to the real axis at distances 0.1 and —0.1 from the origin and lying in 
’ leaf I. ; 
(b) Parallel to the imaginary axis at the distance 0.1 from the origin and lying in 
leaf III. 
(c) Parallel to the imaginary axis at the distance —0.1 from the origin and lying 
in leaves I and II. 
84. For each of the fqllowing functions w = f(z) construct the algebraic equation 
F(z,w) = 0, which generates the complete function, and from this equation determine 
the inverse function z = ¢(w). 


@)w=V2—-1 (b)w=24+VP 41 (w= its 


Describe in each case the structure of the Riemann surfaces in the w- and z-planes, 
pointing out the locations and multiplicities of branch points, branch cuts, etc. 


85. Recognizing that each of the complete functions w = +/z and W = /z —1 
has two branches, construct the algebraic equation that generates the function U = 
w+ W. Determine this function as well as its inverse z = ¢(U), and discuss com- 
pletely the Riemann surfaces in the U- and z-planes. 

86. The expressions 


w=/2 and w=vV/i—sin?z 


represent pairs of single-valued functions rather than multivalued functions. Demon- 
strate the truth of this statement through showing that the.two functions of a pair 
cannot be obtained one from the other by the process of analytic continuation. 
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87. Through the use of Hilbert transforms solve the following: 


: Ji for-yw<y<n 
(a) given u(0,y) = (0 for lol 9% find v(0,y) 
0 for -y<y <1 


Bye HOE foe lulose a 


find 2(0,y) 


(c) given w(Oy) =), ~2 forO<y<y, find 0(0,y) 
1 


142 for -y1 < yy <0 
yi 
0 for |y| > 91 


M1 


ay find »(0,y) 


forO<y <1 


(d) given «(0,y) = 
YL 


: for —y1 < y <0 


a for |y| > 91 


6 
ies for ~n-S<y can $5 


_ 6 

-G— for 5 <9 <n +5 

(e) given u(0,y) = . 3 find 0(0,y) 
1 for ly| <1 — 5 


6 
0 for \y| > +5 


eg for —y1<y <1 
1 


(f) given 909) = J ne fory < —y1 find (0,9) 


nx fory > 41 


—_—_——_—. 
Xe 


CHAPTER VII 


Fourier Series and Integrals 


1. FINITE TRIGONOMETRIC POLYNOMIALS 


In discussing the convergence of Fourier series it is necessary to have 
compact expressions for their partial sums. For this reason, and also 
because one finds expressions of this sort useful in various other problems 
having to do with trigonometric series, a number of formulas are de- 
veloped whereby a variety of finite trigonometric polynomials are given 
in closed form. 

In view of the well-known geometric series 


1 E 
fog TR te Pe ee [1] 
and 
gh 
Bat pate gamit [2] 
one may write 
qT — gntl 
ae bee he [3] 
and 
— ght 


By addition and subtraction respectively these equations yield 


1 — grt ‘ 
— =e (l+2%) = 21 +d@+27) +39@ +27) +004 
3(2" + 2)} [5] 
and 
1 — 3") 1 —1 1/,2 —2 1 
a a) = 2) + Ee) Fe 9) [6] 
Letting 
z= e* = cosx+jsinx [7] 
and noting that 
zg” = e™* = cos nx +7 sin nx [8] 


£56 
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one finds for example, that 


gntl gintb/2 ie g tl) /2 fg 
1-32 a a as gil? g-ll2 (ar + a!) 


2 sin (n +1) = cosn” : 
= 2 2 
a Les D] 
sin. 2 
By use of well-known trigonometric identities, this may further be 
transformed as shown by 


2 sin (n + 1) 5 cos ns sin (2” + 1) 7 
= = —_____“1{ [10] 
_ & . «x 
sm 2 sin. 2 
The use of manipulations of this sort enables one to obtain from Eqs. 
5 and 6 the formulas 


sin (2n + 1)5 


+ ———— = 1 + cos + cos 2x + +++ + cosme [11] 
2sin5 


DO] pe 


and 


cos (2” ++ 1)5 
9 Oot 5 — > Sin # + sin 2x + +++ + sin mx [12] 
2 sin = 

2 
The compact forms thus obtained for the finite trigonometric polynomials 
given by the right-hand sides of Eqs. 11 and 12 are useful in a variety of 
practical problems as well as in connection with various theoretical dis- 
cussions regarding Fourier series. Other formulas are readily obtained 
from Eqs. 11 and 12. For example, replacing the variable x in Eq. 11 
by (« + 2) yields 


(-1)" cos (2n + 1)5 
+ shes _ = 1 — cose t cos 24 ~— cos 34 +--+ 


x 
2 eS 


Nie 


+ (—1)" cos nx [13] 
which is the same polynomial but with alternating signs. Making the same 
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change of variable in Eq. 12 gives 


(=1)" sin (2m + 1) 5 
= tan = +} ———_—— = sin x — sin 2x 4+ sin 34 — --- + 
2 2 x 
2 cOS>5 
2 
(—1)”") sin nx [14] 
Subtracting Eq. 13 for 2 odd from Eq. 11 yields a formula for a cosine 
polynomial with only odd integer multiples of x, thus: 


sin (7 + 1)x 


- = cosx + cos 3x-+cosSa++->++cosnux [15] 
2 sin x 
Similarly, adding Eq. 12 and Eq. 14 for n odd yields 


1 — cos (n + 1)x 


- = sing + sin 3x + sin5x+---+sin nx [16] 
2 sin x 


in which sin? (n + 1)5 may be written instead of $[1 — cos (w + 1)x] 
if desired. 
In Eq. 15 one may introduce the change of variable x ->«% + 5 and 


obtain a sine polynomial with odd multiples of « and alternating signs, 
thus: 


(—1)@~Y?? sin (w + 1)% 


= sin x — sin 3x -+ sin 5% —-:-- 
2 cos x 


+ (1)? sin nx [17] 
Making the same change of variable in Eq. 16 yields 


1+ (-1)%-? cos (n + 1)x 


= cos x — cos 3x + cos 5x ~- - +> 
2 cos ¥ 


+ (-1)%"?!? cos nx [18] 


In these last two transformations it is significant to note that Eqs. 15 
and 16 apply only to odd integer values of 1, so that (m + 1) isan even 


integer. Hence sin (#7 + 1) 5 = 0; whence 


cos (n+1)(x+ *) = cos (1 -+1)x cos (n+1)5 —sin (n+1)xsin (n+1)5 
= (—1)@*)?2 cos (n+1)x = —(—1)” cos (nt+1)x [19] 
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and 


sin n+1)(2+3) = sin (n +1) cos (n+1)5 + cos (w +1) sin (w+1)5 
= (-1)@? sin (n+1)x = —(—-1)°-?? sin (n+1)x [20] 


2. THE ORTHOGONALITY RELATIONS AND THEIR SIGNIFICANCE IN THE 
EXPANSION OF ARBITRARY FUNCTIONS 


The trigonometric functions, in common with many other kinds of 
so-called systems of proper functions, possess a very interesting and 
important property which greatly facilitates the process of represent- 
ing an arbitrary function over a given interval by a series in terms 
of proper functions. Since the underlying principles of this process have 
a broad significance in the solution to problems in potential theory 
and wave motion, a few rather general introductory remarks are in 
order. 

For detailed discussion of the derivation of the following fundamental 
differential equations and for physical interpretation of them, the reader 
is referred to other portions of this reference series, the chief interest 
at the moment being focused primarily upon their purely mathematical 
import. In dealing with problems in potential theory, in which a desired 
potential function is a function of the space co-ordinates (x,y,z) alone, 
Laplace’s equation 


v7V =0 [21] 


is found to govern the behavior of that function, whereas in problems 
involving wave motion, in which the time co-ordinate also is involved, 
this equation is modified by the appearance of an additional term so that 
it reads 
ayy — 1 oD _ 
v 0 Cc or? i 0 [22] 
In the first of these two cases the potential is static, whereas in the 
second it is dynamic — that is, it varies with the time as well as with the 
space co-ordinates. In other words, a problem in wave motion is simply 
a problem in potential theory with an added time co-ordinate. 
The form of the Laplacian operator V? depends upon the particular 
system of space co-ordinates used.* In the commonest of these, the 


*See Art. 18, Ch. V. 
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ordinary rectangular Cartesian system, it is given by 
a? 3 
ssp tapt+s 23 
a2? [23] 


In dealing with the so-called wave equation 22, the first step is to 
eliminate the time variable. This is done through assuming 


UW = Velvn! [24] 
whence 
a0 
“Or ere wn? O [25] 
so that Eq. 22, after cancellation of the common factor e”", reads 
V?V + kn?V = 0 [26] 
in which* 
a 
Wn 
kh, = > [27] 


In Eq. 26, as in Eq. 21, the function V is a function of the space co- 
ordinates only. 

The particular form of these equations now depends upon the system 
of co-ordinates used, and the choice of co-ordinates in turn depends upon 
the geometry of the physical system to which the equations apply. For 
a rectangular geometry the ordinary Cartesian co-ordinates are used; 
for a cylindrical geometry, cylindrical co-ordinates; for a spherical geome- 
try, spherical co-ordinates; and so on. Correspondingly, these particular 
forms of the equations are known by certain names, such as Bessel’s 
equation (for cylindrical co-ordinates) or Legendre’s equation (for 
spherical co-ordinates), and the particular types of functions which 
formally satisfy the equations are largely known by names which relate 
to the particular geometry of the physical system, such as cylinder 
functions (of which the Bessel functions are the so-called first kind) 
or spherical harmonics (also known as Legendre polynomials), and so on. 
These functions are referred to as the proper functions pertaining to the 
particular physical system under consideration, the simplest of them being 
the trigonometric functions which are the proper functions for systems 
having a rectangular geometry. In a single dimension this type of geome- 
try is that of a straight line, like a stretched violin string, for example. 

In terms of some parameter, like parameter ” in sin nx and cos nx 

*The values i, ke,°++, which usually are infinite in number, are determined from condi- 


tions which the function 0 (x,9,2,t) is required to satisfy at certain physical boundaries. 
The details of this process need not be considered at the moment. 
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or, in the case of solutions to the wave equation, in terms of the so- 
called proper values kn, these functions form a set or system. In view of 
the linearity of the equations, it follows that a complete formal solution 
is given by a linear superposition of a set of these proper functions with 
different parameter values and arbitrary coefficients. Thus if ¢,(2,9,) 
represents a proper function for the parameter index 1, the solution has _ 
the form 


V = aii + dodo + azh3 + ++° [28] 


which in general is an infinite series. The coefficients a, are regarded as 
constants of integration which give the formal solution 28 the necessary 
flexibility of meeting certain boundary conditions set by the physical 
problem. 

Thus in a two-dimensional problem in static potential theory, for 
example, the potential “O(x,y) may have to become identical with a 
certain function f(x) for y equal to a particular value, say y = 0, which 
characterizes a physical boundary. Or, in a problem of wave motion, the 
function ‘O(2,y,z,4) may, for the temporal boundary ¢ = 0, for which 
(according to Eq. 24) © = V(x,y,z), have to meet some prescribed 
function. For example, a stretched membrane for which 0 represents 
the displacement of various points from an equilibrium position is given 
a particular deformation from which it is suddenly released at an instant 
which is designated as ¢ = 0, or an electrical transmission line (which 
involves a single space co-ordinate) has, at a given initial instant, dis- 
tributed upon it certain charges which give rise to a specific potential 
function versus distance along the line. These are referred to as imiizal 
or boundary distributions. Inasmuch as they may, in a physical system, 
be arbitrarily specified, the process of solution must be able to fit a series 
of the form given by Eq. 28 to a specified function of one or more of the 
independent variables. The fulfillment of these conditions may neces- 
sitate the selection of certain kinds of proper functions any of which 
formally satisfy the differential equations, such as the selection, for 
example, of a particular kind of cylinder function in the solution to a 
problem with cylindrical symmetry. 

In one dimension this problem takes the form 


f(x) = ayer (x) + Aobo(x) + azd3(x) +> > [29] 


in which f(x) and the functions ¢,(x) are known but the coefficients a, 
are to be determined so as to satisfy this equation. The problem here 
presented is determining the expansion of an arbitrary function f(x) in 
a series of specified proper functions in such a way that the resulting 
series is in general a convergent one. 

The solution to this problem is either impossible, or possible only 
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through the use of ingenious artifices, unless the system of proper func- 
tions (or a derived system formed from linear combinations of these 
functions) satisfies the so-called conditions of orthogonality, which in the 
one-dimensional case are expressed by the equations 


f bm(X)on(x) dx = 


in which a and 0 are the finite limits of the region over which the function 
f(x) is specified. The quantity 7, is ordinarily a constant and can be made 
equal to unity by the incorporation of an appropriate scale factor. This 
process is called normalization, and the resulting functions ¢,,(”) are then 
spoken of as a normalized set of orthogonal proper functions or an ortho- 
normal set. 

The use of the term “ orthogonality ” in connection with the relations 
30 is suggested by the parallelism between these conditions and those 
characterizing an orthogonal matrix.* Since the relationship between the 
present problem and that presented by a set of linear algebraic equations, 
which is thus implied, has rather more than superficial significance, 
this item is discussed in greater detail. 

Regarding the partial sum 


Sn(X) = G11 (%) + dobe(x) +--+ + Ondn (x) [31] 


corresponding to the series 29, and assuming that the function s, (x) 
is specified over the range a S x S 3, one finds that a possible procedure 
for determining the coefficients a, such that the finite polynomial in 
terms of the functions ¢;(«) approximates the given s,(x) is to divide 
the interval a to 6 into m equal subintervals whose midpoints are the 
x-values expressed by a < x1 < 4g <+++ <a <0, and then to write 
the set of m equations , 


Sn(%1) = A161 (41) + dodo(a1) ++ * + Onda (x1) 
Sn (42) = @1¢1(%2) + Gogo(H2) +++ + Andn(X2) [32] 


CY eeeeereee ees een ne 


tn form=n 


0. form xn [30] 


Sa@n) = a;91 nx) + A2h2 (Xn) eee + Anbn (Xn) 


in which the coefficients a, are regarded as unknowns. The values of 
the coefficients thus determined, when substituted into Eq. 31, yield a 
function s,(«) which has the correct values at the selected points x «+ - %» 
within the prescribed interval. At any other points, not rauch can be 
said about the degree of approximation afforded by the finite sum in 
Eq. 31. However, by taking n sufficiently large, and assuming for the 
moment that s,(«) is a smooth function, one may expect a solution which 
fulfills the requirements of a given physical problem. Indeed, as m is 


*See Ch, II, Art. 6, and Ch. ITI, Art. 4. 
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chosen larger and larger, one may expect a closer and closer approxima- 
tion to the partial sum s,(x), which is ultimately identified with the 
specified function f(x). 

Such a process of solution requires, for a finite , the inversion of the 
matrix 


$1(%1)  b2(%1) °° fae 
1%) alta) «+ nla) [33] 


which in general is a laborious task, and becomes hopeless if not impossible 
as ” is increased without limit. However, if this matrix is an orthogonal 
one, its inverse is simply given by the transpose of the matrix 33. The 
solution for any ” is then immediately written down. 

The orthogonality conditions for the matrix 33 read* 


: Xo (e)be(@) = tc aia [34] 


0 forr#¥s 


which are readily recognized as having the same form as the relations 30. 
The solutions to Eqs. 32 are then given by 


as = = bs (Xz) Sn (Xe) | [35] 


This result may be obtained through multiplying Eqs. 32 successively 
by $2(%1), $s(X2), °- + bs (xn), and adding. In view of the conditions 34, 
the sums of all the columns vanish except that of the sth, which yields 
ds, whereas the sum of the left-hand members is seen to be given by the 
right-hand side of Eq. 35. 

The equivalent solution for the coefficients in the infinite series 29 is 
obtained through multiplying this series by ¢. (x) and integrating over 
the region a < x S 6. Then if the conditions 30 hold with 7, = 1, there 
results ; 


a, = foul) de [36] 


which represents the desired solution. 
In order to derive the orthogonality conditions for the trigonometric 


“These are the orthogonality conditions for the columns only. The corresponding ones for 
the rows are not needed for the present discussion. Indeed, it is actually not necessary for 
the matrix to be orthogonal since the existence of orthogonality conditions for only its columns 
is sufficient. 
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functions it is convenient to write them in the exponential form 


1 ey ; 
‘sin nx = 3 (ei — g-I"*) [37] 


cos nx = ; (ei® 4. @In@) [38] 


whence 
. fs 1 : . res ; ays 
sin mx sin nx = — 4 (ehmtne 4 gira) _ eilm—n)e _ g-im—n)z) [39] 


; Li F . ; 
sin mx cos nx = 7. (elimina _. g-imtn)e 4 eilm—n)x _ ¢-im—n)z) [40] 
| 


and 


1; . : . 
COS MX COS NX = a@ en eg TEE ag Nh Dey PN) [41] 


Next it is observed that 


atin eik= at+Qe efka(gik2x = 1) 
ey jkz da=— = 
i eae i jk [42] 


Since e7#27 = 1 for all integer values of &, the numerator in this result is 
zero for all &-values. The denominator is not zero except for k = 0. 
Hence the value of the integral in Eq. 42 vanishes for all k-values except 
k = 0, when the right-hand side of this equation assumes an indeterminate 
form. The latter is readily evaluated if the limit k — 0 is considered to 
proceed gradually. For small values of & the exponential e”*?* may be 
replaced by a few terms of its Maclaurin series, giving 


at+le . ika ; Meo 
f eikz dx = é qd + kn + 1) ay Ver [43] 
a gk 
Thus it becomes clear that 
+20. 2x fork =0 
ke = 
Lf om de = PA for k ¥ 0 [44] 


This is an important property of the exponential function. 
By means of the expressions 39, 40, and 41 it is now readily seen that 


atln ‘ : ata form = in+¥0 
f sin ma sin nx dx = O°. foros okey [45] 


ale : 
f sin mx cos nx dx =Q for all m and [46] 


a2 +z form=232"70 
f cos mx cos nx dx = | ho ete. [47] 
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The fundamental range, so-called, which in the preceding discussion of 
this article is indicated by a S$ x S }, is in the integrals 44, 45, 46, and 47 
seen to be any region throughout which the variable x changes by an 
increment of 2x. The limits on these integrals are, therefore, arbitrary 
except that they must differ by 27 (or a multiple of 27). 

By the same process of analysis it is also readily found that 


1)! 
Ot. — for k odd 
ieee aa 
L rt 0 for & even except [48] 
aw fork =0 


in which » is any integer. In other words, if the fundamental range has a 
“ width ” of only x instead of 27, a result similar to the one expressed by 
Eq. 44 exists only for & even, and then only when the lower limit on the 
integral is an integer multiple (including zero) of z. 

If it is now observed that, for any integers m and 2, (m +n) and 
(m — n) are either both even or both odd, the result 48 together with the 
relations 39, and 41 shows that 


; T 
(v1) - = 0 
f sin max sin nx dx = +9 eS er [49] 
‘ O form #4~ +n 
and 
vin 
ele — f = +n #0 
i: cos ma cos ne dx = ig ee i [50] 
a . 0 form # +n 


in which » is any integer including zero. In other words, the trigonometric 
functions sin nx and cos mx also form orthogonal sets over a fundamental 
range which is only + units wide. However, it is to be observed that a 
relation similar to the one expressed by Eq. 46 does not hold for a range 
of this width. 

If the range 0 < « <r is divided into ” equal subranges, and the 
values %1, 2, °°: %» refer to the centers of these subranges, as shown in 
Figs. 1 and 2 for 2 = 8, then matrices corresponding to the matrix 33 
may be formed for the sine and cosine functions. 

Thus it is found that 

(2k — 1) 
= 51 
,= FS [st] 
so that if 
Ops = COSSXR [52] 
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seas | 


WF 
NS 


WWW 
RK 


WY 
BDSG 


Y Y Yi 
“iyi 
YA ' Yi? 
iyi 
YA' YA! GY 
WiYiy 
YY | GA\G 
ZA'!VZ'W 
Olx, X_ X%3 X4 X53 XX Xe 
Fic. 1. Division of sine into sub- Fic. 2. Division of cosine into sub- 
tanges appropriate to the formation ranges appropriate to the formation 
of an orthogonal matrix. of an orthogonal matrix. 
and 
Bis = SIN Sxz [53] 
then the matrices 
O11 Mig ++ * Ain 
@21 @22 °° * Aon [54] 
Ant Ang *** Ann 
and 
Bit Big Bin 
Ba: Boz +--+ Ban [5] 
Bn1 Bn2 Bnn 
satisfy the conditions 
n 
= forr = +s except 
> 2 
Akrhks = 
nas n forr=s=0 [56] 
0 forr~ +s orr= +(s =n) 


and 


n 
+ 2 forr = +s except 


BirBks ae eae epee 4(s = n) [57] 


0 forr# +s orr=s=0 


Ms 


> 
i] 
_ 
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These may be put into a form in which they more nearly simulate the 
expressions 49 and 50 through noting, with reference to Figs. 1 and 2, that 


vis 
=- 5 
Ax ba [58] 
Hence 
T 
P = forr = +s except 
>» Akraks AX = 2 [59] 
es lai x forr=s =0 
0 forr ¥ ats orr = +(s =n) 
and 
TT 
‘ + 3 forr = +s except 
rks AX = 
2, Pe cee xa forr = +(s =n) [60] 


0 forr#¥ +s orr=s=0 


These, or the expressions 56 and 57, may be verified independently. 
Thus 


ObrOles = COS Ty COS Sy = Z{cos (r — s)x, + cos (r + s)xx} [61] 
and | 
BerBre = Sin rx, sin sx, = 3{cos (r — s)a, — cos (r+ s)xz} [62] 
Now by Eq. 51 
n i= -—1)r—- | 


n 
COS (7% — S)X, = cos 
~ ( ) 7 a 2n 


(r — s)r 3 3(r — s)ar (2n — 1)(r — s)@ 


COS eee ie Oe [63] 
This expression is the same as the right-hand side of Eq. 15 of the pre- 
vious article except that (r—9 takes the place of x, and (2m — 1) 
takes the place of . Hence it is found that 

n sin (r — s)r n forr=s 
ae be) 64 
Pp Ce oS) 9 sin (r — s)r 0 forr#s except Lo4] 
—n forr = —(s=n) 
Similarly 
n sin (ry + s)r n forr = —s 
= —____ = 65 
~ eos t]he 2 sin (r+ s)r 0 forr # ~s except [65] 


2n —n forr = (s =n) 
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The last two relations, together with those stated by Eqs. 61 and 62, 
verify the conditions given by Eqs. 56 and 57. 

If the functions az, and 6,, are normalized through division by 
Vn/2, it is observed that they fulfill conditions of the form given by 
Eq. 34 except for r = s = n. This means that if the coefficients in the 
partial sum 31 are approximately evaluated by means of the formula 35, 
a correction must be applied for the coefficient a,. It should be observed 
in this connection, however, that the values of the coefficients a; obtained 
by such a method become more and more approximate the nearer the 
index k approaches n. As pointed out previously, the partial sum s,,(x) 
converges toward the desired function f(«) only as 1 is increased without 
limit. For a sufficiently large finite ”, the partial sum s,(«) may be 
regarded as an approximation to f(x) which is close enough for certain 
practical purposes, but the degree of the approximation is then very 
likely little affected if one entirely ignores the last term, or perhaps 
several terms in this vicinity of the partial sum s,(«). This item is further 
discussed in Art. 15. 

It may also be observed that the matrices 54 and 55, with the elements 
as defined by Eqs. 52 and 53, are actually not orthogonal matrices since 
the orthogonality conditions are fulfilled only for their columns, and not 
for their rows. As far as the present problem is concerned, however, 
this situation is not significant since none of the reasoning given in the 
present article is thereby affected. 


3. THE FOURIER SERIES 


The previous article indicates how a representation in the form of an 
infinite trigonometric series may be found for a given: function f(x) 
specified over a finite interval a S$ x S a + 2m. Incidentally, this desig- 
nation for the interval may be regarded as perfectly general inasmuch 
as any other, such asa <% < a + ?£, is converted to it by means of the 
change of variable x — = . 

v 


The trigonometric series, or Fourier series, as it is called, has the form 


a 
f(x) = G + a cos % + ag cos 2x + a3 cos Sa + ++: 


+ 6b; sin x + bg sin 2x + b3 sin 34 ++: 


[66] 


The necessity of having both sine and cosine terms is readily recognized. 
Writing the series 66 in the more compact form 


f(x) = ° + x a, cosnx + YO by sin nx [67] 
n= n=1 
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multiplying both sides by cos mx, and integrating over the fundamental 
range, one has . 


a} 2a ao ale 
f f(x) cos mx dx = > cos mx dx 
3 2 Ja 
at2r 2 
¥ Gn cos nx cos mx dx 
@ n=1_ : 
at2[r © 
L bp sin nx cos mx dx [68] 
a n=l 


The first integral on the right-hand side is zero except for m = 0. Exclud- 
ing this value, and assuming for the moment that the series 66 or 67 
converges uniformly over the fundamental range a S$ x S a+ 2m, so 
that the integration of the infinite sums may be carried out term by 
term, one finds 


a+ 2a bed ala 
f f(x) cos mx dx = XY an f cos nx cos mx dx 
a n=l a , 


bed a+r 
+ XL bn f sin mx cos mx dx [69] 
In view of the conditions expressed by Eqs. 46 and 47, one obtains 
1 at+2er ‘ 
om = — f f(x) cos max dx [70] 
T a 


This formula evaluates the coefficients of the cosine series, and inci- 
dentally gives the coefficient a9 correctly also, inasmuch as the ortho- 
gonality conditions 46 and 47 applied to the term-by-term integration in 
Eq. 68 yields for m = 0 


a+ 2a do ale 
[Fe ae = oP ae = ae (71) 


If Eq. 67 is multiplied by sin mx and integrated term by term over 
the fundamental range, it being recognized that the term with a» then 
yields zero even for m = 0, one obtains 


a+ 2a | fa+2e \ 
f f(x) sin mx dx = XY an cos nx sin mx dx 
a n=1 a 


iad at2er 
+ ¥ db, J sin mx sin mx dx [72] 


n=l 


Here the use of the orthogonality conditions 45 and 46 gives 


1 ater 
bm = — f(x) sin mx dx [73] 


Tva 
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which is a formula for the coefficients of the sine series in Eq. 66 or 67. 

A convenient form for the formulas 70 and 71 is obtained if the funda- 
mental range is specified as —w < x S 7, as can always be accomplished 
by a suitable definition for the independent variable. Then, » being 
written in place of m to provide a more evident consistency with the form 
of the series as stated by Eq. 67, the formulas read* 


a, == f f(e) cos ne de, forn = 0,1,2,--: [74] 
ar J—r 


and 


bn, = = ffl) sin nx dx, form = 1,2,3,--° [75] 
T —T 


It is now significant to observe that if the specified function f(x) is 
even, that is, if 


f(—«) = f@) [76] 


the integrand in the integral for b, is an odd function of «, and the 
integral between the symmetrical limits — and mr vanishes. In other 
words, all the coefficients b, are then zero, so that the function f(x) is 
represented by a cosine series alone. This should, of course, be expected 
for an even function f(*) since the cosine functions are even and the sine 
functions odd. 

On the other hand if f(x), with regard to the fundamental range, 
—mr Sx Sz, is an odd function of x, that is, if 


f(—2) = -f@) [77] 


the integrand in the integral for a, is an odd function, and this integral 

vanishes for the limits —z to 7. All the coefficients of the cosine terms 

are then zero, and the function f(x) is represented by a sine series alone. 
This state of affairs suggests that in general one may write 


f(x) = fiw) + fo(x) [78] 


in which f,(x) is an even function and fo(x) an odd one. That such a 
representation for the function f(x) should always be possible is clear 
from the fact that if x in Eq. 78 is replaced by —x, and the relations 
76 and 77 are observed with regard to the even and odd functions /, (#) 
and f(x) respectively, there results 


f(—«) = fila) — fa) [79] 


*It may not be superfluous to point out that the derivation of these formulas for the 
Fourier coefficients in no way proves that the series can actually be used to represent a periodic 
function. The conditions under which such a representation is possible and a discussion of 
the convergence of the series are given in Arts. 7, 8, 12, 13, 14. 
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Addition and subtraction of Eqs. 78 and 79 then yield 

fie) = 31 f@) +f(-2)} [80] 
and 

fala) = 31 fe) —f(-*)} [81] 


Hence the even and odd components f,(x) and f2(%) of an arbitrary 
function f(«) can always be uniquely determined either analytically 
or graphically. 

Substituting the representation 78 for f(«) into the integrals of Eqs. 
74 and 75, and taking note of the odd and even character of the functions 
f(x) and f(x”) respectively, one finds that 


1 ar 
an =— fi(x) cos nadx, forn = 0,1, 2;--- [82] 
TI —4r ‘ 
and 
bn = * fale) sin nx dx, forn = 1,2,3,--: [83] 
Tv _—e 


In other words, the coefficients of the cosine terms in the series 66 are 
determined from the even component of f(x) alone, and the coefficients 
of the sine terms are determined from the odd component alone. Inas- 
much as an arbitrary function f(x) must in general contain both even 
and odd components according to the decomposition expressed by Eq. 
78, it follows that the trigonometric series representing f(«) must contain 
both sine and cosine terms. When it is assumed, then, that the even and 
odd functions respectively are completely represented by cosine and sine 
series, it follows that the form of the assumed series representation for 
f(x) as given by Eq. 66 is also sufficient. This question, as well as the 
manner in which the trigonometric series approximates the function f(«) 
as the number of terms in its partial sum is increased, is discussed in the 
subsequent articles. 

In the meantime it may be useful to observe that the integrands in 
both the integrals 82 and 83 are even functions of x. Consequently the 
same value is obtained if the integration is extended only over the range 
zero to m and the result is multiplied by two. This gives the alternative 
formulas 


an = = [ fale) cos nx de, for” = 0,1,2,-°- [84] 
T 


and 


b, = 2 [Ao sin nx dx, forn = 1,2,3,--- [85] 
wT 
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In the discussion so far, it has been assumed that the function f(x) is 
defined only within the interval a < x Sa-+ 2z, that is, that the 
function does not necessarily exist also for values of x not in the stated 
range. For example, if f(x) represents anes initial distribution of voltage 
on a transmission line and the interval a@ S$ x S a + 2m corresponds, by 
reason of a suitable change of variable, to the length of that line, it is 
manifestly clear that an inquiry regarding the values of this function 
beyond the limits of its fundamental range has no sense. From a practical 
point of view such an inquiry does not arise since one is content with a 
solution valid over the extent of the physical system. 

It is, on the other hand, sensible to inquire, for the moment out of 
pure curiosity perhaps, what kind of function the trigonometric series 
represents when all restrictions on the independent variable are removed, 
in other words, if x is allowed to take on all values from — to «. In 
view of the periodic character of the trigonometric functions, the answer 
to this question is obvious. The trigonometric series represents a periodic 
function with the period 2x, such that the behavior of the function 
throughout any one period matches that of the function f(«) over its 
fundamental range. 

It is thus clear that the trigonometric series in Eq. 66 is capable of 
representing an arbitrary periodic function over the entire range of its 
independent variable from minus to plus infinity. Such functions occur 
quite frequently in engineering analysis, the independent variable 
usually being the time ¢. The periodicity of the function is expressed by 
the relation 


fé+ kr) =f@ [86] 


in which 7 is the period (sometimes referred to as the fundamental period) 
-and k& is any positive or negative integer. The reciprocal of +r is the 
fundamental frequency 


1 
i= - [87] 


and 


o = Inf =— [88] 


is the fundamental angular frequency or radian frequency. 
The Fourier series representation for the periodic function f(#) is 
written 


f(t) eh a 6os wk 4 ay coe Quah ee 
: [89] 
+ by sin wt + bo sin 2wt +-+- 
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in which, according to Eqs. 74 and 75, 

an = a ZO cos mwt dt, forn = 0,1,2,--- [90] 
and 


x | w 
.b, = “f ot sin nwt dt, forn = 1,2,3,--- [91] 
TU—« la 


4. THE PHASE ANGLES OF THE HARMONIC COMPONENTS 


In view of the trigonometric identities 


Cn, 008 (Mut + dn) = cn sin (nat + 5 as én) 
= Cn COS bn COS Nut — Cz Sind, sin nwt [92] 


it is possible to combine the sine and cosine series in Eqs. 66 or 89 through 
letting 
an = Cy, COS on 


b, = —Cy SiN on [93] 
whence 
Ca = Van? + b,2 [94] 
én = —tan (*) 
Dn, 


The Fourier series can then be written as a sum of sine or cosine terms 
alone, for example, as 


f®= $ + ¢1 cos (wt + $1) + ¢g cos (2ut +2) +--- [95] 


The term having the fundamental angular frequency is commonly 
called the fundamental component of the periodic function f(#), and the 
remaining terms,- whose frequencies are integer multiples of the funda- 
mental, are referred to as harmonics. The coefficients c, and the angles $n, 
which are determined by Eqs. 94 together with Eqs. 90 and 91, are known 
as the harmonic amplitudes and phase angles. 

Because of the periodic nature of the function f(t) it is usually possible 
to select the origin for the independent variable ¢ at any convenient point. 
In subsequent manipulations it may, nevertheless, be desirable to shift 
the origin to a new location. Thus, if the variable ¢ in f(¢) is replaced by 
(¢ — f), the effect is to shift the origin back by an amount equal to f& 
(this shift amounts to retarding the sequence of values of the function 


454 FOURIER SERIES AND INTEGRALS (Ch. Vit 


by fo). Graphically, this change of variable corresponds to shifting the plot 
of the function forward by an amount fy. A typical harmonic component 
becomes 


Cn CoS [new(t — to) + dn] = cn cos [nwt + (on — nto) ] [96] 
This result may evidently be interpreted as a change in the harmonic 


phase angle to the new value 


Qrnt 
¢'n = bn — Nato = on — — [97] 


It is significant to note here that each harmonic phase angle is changed 
by an increment proportional to the order 1 of the corresponding har- 
monic component. Such a proportionate change in the harmonic phase 
angles, therefore, leaves the resultant form of the function f(¢) unchanged 
except for a translation of the function as a whole. 


5, EVEN AND ODD HARMONICS 


It frequently occurs that the given perodic function satisfies the 
condition 


i(t+2)=-F0 [98] 


which means that the sequence of values of the function throughout 
any half period are the negatives of the values encountered throughout 
the preceding or succeeding half period. In order to observe the effect of 
this condition upon the form of the resulting Fourier series, one may write 
the expressions 90 and 91 for the coefficients in the form 


ay, = f ars lf () cos ee +f (: - 4) cos nw (: - *] dit [99] 
Bn ° f nt [fo sin not +f ¢ = | sin nw ¢ = ] dt [100] 


These are readily recognized as being equivalent to Eqs. 90 and 91 
through noting, for example, that 


ff0 cos nwt di = LH a 4) Cos Nw ¢ — a) dt [101] 


because the second integral is obtained from the first when one makes the 
substitution ¢ > 4 — 7/2 and appropriately changes the limits of integra- 


and 
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tion. Then, observing that nw7/2 = nz, and hence that 


COS Nw (« ~ 4) = COS Mr COS Nut 
r [102] 
sin nw (: - 4) = cos Mr sin nwt 


one finds that the condition 98 substituted into Eqs. 99 and 100 is seen 
to yield 


ae fos 
an = 2 (1 — cose) ff) cos met dt [103] 
and 
(3) feo 5 
by = © (1 — 005 me) f f(t) sin nat di [104] 


The factor (1 — cos mm) is zero for all even integers of 2 and equal to 
2 for all odd integer values of ». Hence it is clear that when the periodic 
function has the property expressed by Eq. 98, its Fourier series contains 
odd harmonics only. The average value or constant component 49/2 is 
evidently zero also. The coefficients of the odd harmonics are then given 


by 
20 frie 
1 a i FS (é) cos nwt dt [105] 
and 
2w x foo P 
by. = = f F@ sin not dt [106] 


These considerations may suggest the question of what the results are 
when the function f(#) satisfies a relation complementary to that ex- 
pressed by Eq. 98, namely, when 


ia d)= +70 [107] 


Although recognizing that the factor (1 — cos mm) in Eqs. 103 and 104 is 
changed to (1 + cos mm) gives the answer a moment’s reflection reveals 
that the condition 107 merely states that f(t) has the period r/2 instead 
of r. If rc is retained as the fundamental period, the result is that the 
Fourier series contains only even harmonics, which is just another way 
of saying that the period of f(t) is actually half as large. 

On the basis of these thoughts it appears that any periodic function 
may be assumed to consist of two components, of which one satisfies the 
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condition 98 and the other has twice the fundamental frequency. This 
decomposition may be indicated by 


{O=hOt+inO [108] 
in which 
fi (: = ) = fr(t) [109] 
and 
fu (« ae 4) = —fir(¢) [110] 
It then follows that 
s(t +3) = fit) — fir) [111] * 
and hence, adding and subtracting Eqs. 108 and 111, that 
ORE OR] (29 | ET 
fa = 3170 -s(+#3)| [113] 


This decomposition is similar in analytic form to the decomposition 
into even and odd components as stated by Eqs. 80 and 81, but should 
not be confused with the latter. The 
decomposition into even and odd 
components according to Eqs. 80 and 
81 is a decomposition of the given 
function into components which 
are respectively symmetrical and 
antisymmetrical about the ordinates 
at x = 0 or# = 0, and hence yields 
a decomposition of the Fourier series 
Fic. 3.. Decomposition of a saw-tooth into its component cosine and sine 

wave into even and odd harmonics. series, each of which in general con- 

tains both even and odd harmonics, 
and hence can be further decomposed according to the relations 112 and 
113. 
_ The component functions given by Eqs. 112 and 113, on the other 
hand, may individually be either even or odd but in general are neither, 
and hence can also be further decomposed. The decomposition indicated 
by Eq. 108 is such that the first component contributes the even har- 
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monic components to the resulting Fourier series, and the second con- 
tributes only the odd harmonics. An interesting example of this decom- 
position is illustrated for the saw-tooth wave shown in part (a) of Fig. 3. 
Here the rectangular wave (shown dotted) represents the odd harmonic 
components, and the even harmonics are due to the saw-tooth wave of 
double frequency, part (b), which remains after the rectangular com- 
ponent is subtracted from the original saw-tooth wave. 


6. ALTERNATIVE FOURIER EXPANSIONS FOR A FUNCTION DEFINED 
OVER A FINITE RANGE 


The preceding articles show that there is an infinite variety of ways to 
establish a Fourier series representation for a function which is defined 
over a finite region, if the sole object is to obtain a trigonometric series 
which yields the correct values of the. stated function over this finite 
range only. One way is to consider the defining range of the given function 
as the fundamental period, as is done in Art. 3. However, since this range 
may alternatively be considered as only a part of the fundamental 
period, and since the definition of the given function over the remainder 
of this period as well as the extent of the period are entirely arbitrary, 
it is clear that any number pf trigonometric series representations may be 
found. All of them yield the same values over that portion of the period 
which corresponds to the original defining range, although they may show 
a variety of behaviors throughout the remainder of each period. 

It should be observed that if the determination of a Fourier expansion 
appears as part of the process of solving a boundary value problem, such 
a variety of alternative possible procedures does 
not exist, because the conditions of the problem 
permit the choice of only one set of proper 
functions in terms of which the expansion may 
be made. There are, however, many other ways 
in which Fourier expansions enter into engi- 
neering analysis, and in most of them the ob- Fy. 4. A function de- 
ject of using the series representation is solely fined over a limited range. 
to get an approximating function in the form ofa 
trigonometric series for some given function whose behavior is specified 
over a finite interval. Inasmuch as the engineer must, for practical 
reasons, limit the expansion to a finite number of terms, and the computa- 
tional labor as well as other economic factors dictates that this finite 
number shall be as small as possible, the freedom of choice mentioned 
in the preceding paragraph must be recognized as an important con- 
sideration in the selection of a suitable process of analysis. 

As a simple illustration of how a variety of series representations may 
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be obtained for a given function defined over a finite range, it is interest- 
ing to consider the function illustrated in Fig. 4 for which the defining 
range is the region 0 S$ x S x. Parts (a) to (e) of Fig. 5 show several 


sines and cosines 


~s. 01 Aas! “4 
odd harmonics only 


(a) 
sines only 


xX o7 
Le 


(c) 


7°10 xy MAL Pod 
odd harmonics only 
(d) 


- 
orn? wee, 
ia ~ 
of 


ao ~, 


~ 
“Ss 


* 
good approximation with a 
single sine term 


e) 


Fic. 5. -Various possible periodic continuations of the function of Fig. 4. 


ways in which this function may be assumed to be continued beyond the 
defining range so as to yield a periodic function. In each case the behavior 
over the defining range is the same as that given in Fig. 4, but the series 
representations for the individual cases are nevertheless quite different, 
as indicated on each skétch. 
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It is also significant that the rate of convergence of the resulting Fourier 
series may be quite different for the different forms of periodic functions. 
As is also pointed out in a subsequent article, the convergence is in general 
more rapid for a smooth function than it is for one which varies er- 
ratically or has large first or higher derivatives. For example, the series 
for a function which has discontinuities, like those sketched in parts 
(a) to (c) of Fig. 5, in general converges rather slowly, so that a large 
number of terms must be calculated in order to establish a fairly good 
representation for the function. If the latter is continuous, as it is in 
parts (d) and (e) of the figure, the convergence is considerably more 
rapid. The function shown in part (e) has a more rapidly converging 
series than the function in part (d) whose first derivative has discon- 
tinuities. In fact, the function in part (e) is approximated very well by a 
single sine term. 


7. THE FOURIER SERIES AS A SPECIAL FORM OF THE LAURENT 
EXPANSION; THE COMPLEX FOURIER SERIES 


The Fourier series may alternatively be obtained from the Laurent 
expansion discussed in Art. 12 of Ch. VI. According to Eqs. 163 and 166 
of that chapter, the expansion has the form 


f@) = =. ain(Z — fo) [114] 


in which the coefficients are given by the 


formula 
1g fee 
aot ras (¢ — 20)" [115] 


The region of uniform convergence lies 
between the circles r; and 72 indicated in 
Fig. 6, for which it is assumed that 7; < 1 : ‘ 
<7». The expansion 114 then converges Fis. ©. Region of uniform 
: : <qves convergence of a Laurent series 
uniformly for all points on the unit circle fron which, the Fourier: series 
drawn with zo as a center. As discussed in may be obtained by an appro- 
the previous chapter, this statement as- priate change of variable. 
sumes, of course, that the function is reg- 
ular and continuous at all points within the annular region enclosed 
by the circles with radii r; and rg. No restriction is implied by the 
assumption that 7, < 1 <2, since this condition may in any case be 
obtained by means of an obvious change of variable, should it not be met 
‘in the first place. 
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If the point z in the expansion 114 is assumed to refer to any point on 
the unit circle, and the latter is also chosen as the path of integration S 
in the formula 115, one may write 


z— % = e¢ [116] 
and 
[—%24= ev [117] 
whence 
dg me hud 
=jd 118 
o — 20 os U8) 


The expansion 114 then takes the form 
f@= zx ane”? = f(¢) [119] 


in which, according to Eqs. 115, 117, and 118, 


_ 1 pf dy 1 ny 
~ Ong Ja eind =x. fern dy . [120] 


an 


Since the point z is restricted to lie on a circle about zo, f(z) evidently 
reduces to a function of the real variable ¢. In order to conform with 
more familiar conventions, the symbol ¢ may be replaced by x and the 
variable of integration y by ¢, so that Eqs. 119 and 120 take the form 


f(@) = EL anei™® [121] 
and | 
1 a+ 2a : 
On = fee dé [122] 
2a Ja 


Although the function f(«) is real, it should be observed that the 
coefficients a, are complex. The series representation for the function 
f(x), as expressed by Eq. 121, is known as the complex Fourier series. 
It may readily be shown to be entirely equivalent to the real form in 
terms of sines and cosines. The first step in this regard is the observation 
that, according to Eq. 122, the coefficient a, is replaced by its conjugate 
value if 2 is replaced by —n, that is, : 


This result is clear from the fact that changing the sign of m in Eq. 158 
is equivalent to changing the sign of j. The series 121 may be written 
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out in the form : 
fle) = aot { oe ben te tos 


aye + ape i + a_3e? a a 


[124] 


from which it is seen that the terms appear in pairs of conjugates. A 
typical conjugate pair reads 


ane + a_pe7in® [125] 
Now if one writes 
_ an ~ fbn 
a= 2 [126] 
then, according to Eq. 123, 
aes eee [127] 


The pair of terms given in Eq. 125 then becomes 
ane + ape i"* = dy cos nx + by, sin nx [128] 


whereas the separation of Eq. 122 into its real and imaginary parts yields 


1 a+2e 
On = = F(€) cos né dé [129] 
and , 
. 1 a+ 2x 
by = ~ f f(€) sin né dé [130] 
TdJa : ; 
ao 


Since by = 0, and hence ap = Eq. 128 shows that the series 124 is 


2 , 
equivalent to 


f(x) = B+ a; cos # + ay Cos 2a + +++ 
+ by sin x + bo sin 2x -+--:- [131] 


which is the familiar form for the Fourier series in terms of its sine and 
cosine components. The formulas 129 and 130 are seen to be identical 
with those given by Eqs. 70 and 73. The equivalence between the com- 
plex form 121 and the ordinary form for the Fourier series is thus estab- 
lished, and the formula 115 for the coefficients of the Laurent expansion 
is seen to contain the formulas for the Fourier coefficients as a special case. 

The complex form for the Fourier series is more convenient than the 
ordinary form for many manipulations because of its relative compact- 
ness as contrasted with the sine and cosine form, and also because of the 
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greater ease with which the exponential function may be manipulated 
with regard to various algebraic as well as differential and integral oper- 
ations. In this connection it may be observed, incidentally, that the 
complex form requires only the one formula 122 for the evaluation of 
its coefficients as contrasted to the two formulas, 129 and 130, which are 
ordinarily needed. 

The relation between the complex coefficient , and the real coefficients 
a, and b, is further illustrated by the sketch in Fig. 7 which shows a 
pair of conjugate complex coefficients 
am and a_n. These may be regarded 
as the conjugate complex terms 

ane? and a_n»eé'”* for x = 0. As the 
erable x increases from the value 
zero, the vectors ‘which these terms 
represent rotate in opposite directions 
at an angular rate of » radians per 
unit of x. They, therefore, remain 
conjugates for all values of x, so that 
Fic. 7... Aqiair of conjugate complex their vector sum becomes a real, 
coefficients of the complex Fourier simple harmonic function of x. For 

series. ' x = wt this pair of conjugate terms 

is thus seen to represent a simple har- 

monic function of the ‘time ¢ with an angular frequency of mw radians 
per second. 

Denoting the angle of the complex coefficient a, by $n, as indicated 
in Fig. 7, that is, writing 


= Jon| ei», ain = lan| eae [132] 
one finds that the expression for this simple harmonic function is given by 
ane + a_ne in = Jan| (e2* ton) + einz+$,)) 


2|an| cos (mx + dn) [133] 


complex plane 
! 


Equations 126 and 132 give 
n — Jon = 2|an| cos dn +J2\|an| sin on [134] 


or separating reals and imaginaries this yields 
: Gn = 2|an| COS dn 

: [1335] 

bn = —2lan| sin dn 


Comparison with Eqs. 93 and 95, shows that the resulting harmonic 
amplitude c, is equal to éwice the magnitude of the complex Fourier 
coefficient a,, whereas the harmonic phase angle is the angle ¢, of the 
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complex coefficient a,. These results are also evident from inspection 
of Fig. 7. 

The complex form for the Fourier series is thus seen to contain the 
phase angles of its various harmonic components by virtue of the complex 
character of the complex coefficients. The formula 122 for these coeffi- 
cients yields these harmonic phase angles as well as the corresponding 
amplitudes 2|a,,|, and the complex form of the series, as expressed by 
Eq. 121, contains both the amplitudes and the phase angles although the 
latter do not appear explicitly. This fact, namely that the harmonic 
phase angles are implicitly contained in the complex form 121, and hence 
need not be explicitly written down during any manipulations which 
may be carried out with the series, is one of its greatest labor-saving 
virtues when the Fourier series is used in various analytic formulations. 
This advantage becomes particularly apparent from the discussions given 
in Art. 9. 

Inasmuch as the basis for the Laurent expansion requires that the 
function f(z) be regular and continuous throughout the annular region 
enclosed by the circle with radii 7; and 72 in Fig. 6, one should expect 
that the validity of the Fourier series would be similarly restricted. 
Although this restriction, in general, holds if one expects uniform con- 
vergence of the series for all values of its independent variable, it is found 
nevertheless that the conditions for the possibility of obtaining a Fourier 
series representation for a given function are somewhat less confining if 
certain reservations are accepted regarding the convergence of the result- 
ing series. 

The conditions, known as the Dirichlet conditions, under which a 
Fourier series representation for a given function is possible, state that 
throughout the fundamental range for which the function is defined, it 
shall possess a finite number of maxima and minima, and in spite of having 


a finite number of discontinuities and points where the function becomes . 


infinite, it shall possess an absolutely convergent integral; that is,* 


[Fela [136] 


shall be finite. If, according to the latter part of this statement, the func- 
tion becomes infinite at some point, this infinity shall be integrable, for 
example, like a logarithmic infinity. (It should be recalled that the area 


b 
*The absolute convergence is required only if the integral f f(x) dx becomes improper. 


Actually, the conditions set down by Dirichlet are more stringent than those just stated, and 
do not permit the function f(x) to become unbounded. The conditions stated here are, never- 
theless, designated by many writers as the Dirichlet conditions, and inasmuch as it is a con- 
venient way of referring to them, this somewhat inaccurate practice is used here also. 
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under a logarithmic infinity is finite.) The series converges uniformly for 
all values of the independent variable except those corresponding to 
points of discontinuity, and at the points corresponding to infinite values 
for the function the series can, of course, no longer converge. 

At a point of discontinuity, which may be denoted by x = %, the 
Fourier series yields the average of the two values of the function imme- 
diately adjacent to this discontinuity, that is, the series yields the value 


The reason for this property of the series is discussed in Arts. 12 and 13. 
Together with a number of other interesting items, it is illustrated by 
some of the examples of the next article. 


h(x) 


Fic. 8. The saw-tooth wave is basic in the description of disccntinuities. 


8. SEVERAL ILLUSTRATIVE EXAMPLES; A CRITERION REGARDING 
THE RATE OF CONVERGENCE 


The first example to be considered is shown in Fig. 8. This function 
h(x) has discontinuities of unit magnitude which occur at the origin and 
at integer multiples of 27. It is similar to the saw-tooth wave shown in 
Fig. 3 except that it is turned over and raised so as to lie upon the x-axis. 
This function may be defined by the statements 


1 
joe ee for -7§ <4 <P 

2 2r 

hoes a3 [138] 
h(x) == +" for 0<a2<7 

2 2a 


It is observed, therefore, that except for the constant component 4, the 
function is odd. Hence its Fourier series is given by 14 plus a sine series 
whose coefficients are evaluated by the formula 85. These coefficients are, 
therefore, given by 


. ‘41 ir : 
=f (x — x) sin nx dx [139] 
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The integration yields 


1 
so that the resulting Fourier series becomes 
1. 1fsinx , sin2x | sin 3x 
na) = 5 + =| + 9 as 3 too] [141] 
At the points x = 0, 2x, 4x, --+ —2m, —4m, +++ etc., the value of the 


series is 14, which is the arithmetic mean between the values of h(x) 
immediately adjacent to these points of discontinuity as stated by the 
expression 137. For the immediate vicinity of these points the series 
cannot be said to converge uniformly because it yields different values, 
depending upon the direction from which these points are approached, 
and hence the partial sums do not converge toward a definite limit in this 
vicinity as 7 is indefinitely increased. 

In order to investigate the convergence for other values of x one may 
regard the partial sum 


sa(t) = re SE ot aad [142] 
and form 
 [sn4e(@) — Sa()| 
- L sin wate sin oe Beane sin ams 143] 
Letting 
on = Sinx + sin 2x +--+- + sin nx [144] 
one may write the expression 143 
[see (@) — Sna()| 
-* rer ae ees ee ee ae a! 


The right-hand side of this equation may be written alternatively in the 
form ; 
1 


wv 


ae. (4 - a oe C5- 1 Ve 
n+i Neti ata) OF Nt 2 2 +3) 


A 1 Ont+k 
+ onst-s(——5 me | - 15) + 2 [146] 
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Now, according to Eq. 12, 


cos 5 — cos (2m + 1)5 
2 = [147] 


. 2 
2sin 5 


Although, for various values of ” and x, this sum o, may have a variety 
of positive or negative values (in particular for x = 0 or a multiple of + 
it has the value zero), it is clear that a finite positive quantity S may be 
found which the absolute value of o, cannot exceed for any x and any n, 
no matter how large the value of the latter may be chosen. In other 
words, it is possible to specify that 


lon| <.S for any x or values [148] 


in which S is finite. 

Observing that the coefficients of the partial sums on41, Onta,*** On+k 
in the expression 146 are positive, and that o, may have a numerically 
negative value while the remaining partial sums are at the same time all 
positive, one sees that, even in these most unfavorable circurastances, this 
expression nevertheless cannot have a value in excess of 


SYiae Gollinat al ll Ay) eae | Oe 
rinti nti nt+2 n4+2 n+3 n+k- ntk| w(n+1) 
[149] 
Hence it is established that 
2S 
|snpu(%) — Sn(x)| < GED for all w and k [150] 
Consequently the statement that 
|Sntn(%) — Sn(x)| <€ independently of x . [151] 
” for all values of & and all values x > N may be justified through choosing 
2S 
eG) <e€ [152] 
or 
2S 
(a+1)>— [153] 
TE 
which means 
=e 1 [154] 
TE 


According to Cauchy’s principle of convergence (Eq. 94, Art. 9, Ch. VI), 
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therefore, the Fourier series 141 for the function illustrated in Fig. 8 is 
seen to converge uniformly over any range of x-values which excludes 
the points of discontinuity. 

The results obtained for this simple example enable one to draw 
conclusions regarding the convergence of Fourier series for more arbitrary 
functions possessing a finite number of discontinuities. As a first step 
toward evolving such a generalization, a function f(«) may be assumed 
to have a single discontinuity equal to the value 6 at some point x = x, 
in the fundamental range a S x S a + 2z. In other words, the function 
f(x) is continuous throughout this range except for a sudden jump 6 
(positive or negative) in its value at the one point x = ~%,. 

In terms of the function k(x) of Fig. 8, one may form a function 
8 - h(x — x,) which is a saw-tooth wave having a jump equal to 6 at the 
point « = x, in the fundamental range. If this function is subtracted 
from f(«), the resulting function 


F(x) = f(x) — dh(% — a) [155] 


must be continuous throughout the fundamental range a S$ + Sa + 2r. 

With this result tucked away in one’s mind for future reference, at- 
tention is for the moment turned toward the formula 122 for the complex 
Fourier coefficients, which is repeated here for the convenience of the 
reader 


Qn = = f 7 seer int dé ; [156] 


Here one may apply the principle of integration by parts, for which the 
well-known formula reads 


fud = - vdu [157] 


Letting : . 

u=f) and d=e "dé [158] 

one has 

e ant 
jn 

in which the superscript (1) on the function f(é) indicates that its first 

derivative is meant. Subsequently, the superscripts (2), (3), etc., are 


used to denote the second and higher order derivatives. 
Substitution into the formula 157 yields for the coefficient a» of Eq. 156 


—jntNa+2n +22 : 
On = - fee + ad f (E)e nt dt [160] 


du = f (&) dé and v= [159] 
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However, 


fee" at2e 7 ee - _ 
He = F(a + 2) -F@)}=0 [160 


2Qary a 


By repeatedly applying ae same tie one may obtain the formula 
ky en int d. : 162 


which, of course, is a proper integral only so Jong as the kth derivative 
f (€) remains finite, although it may be discontinuous. 

It may now be supposed that the given function f(*) and all its suc- 
cessive derivatives f(x), f® (x), --- f@-(*) are continuous, but 
that the Ath derivative f(x) possesses the discontinuity 6 at x = x,. 
Then, according to the argument leading to Eq.°155, the function 


FO (x) = f (e) — bh — a) [163] 


is still continuous, and hence for it the procedure leading to the integral 
162 can be continued at least one step further. Hence the Fourier co- 
efficients a, for the function F“(«) must decrease in magnitude for 
large values of 1 at least as rapidly as the ratio 1/(7)**!. Inasmuch as the 
illustrative example at the beginning of this article shows, however, that 
_ the Fourier coefficients for the function h(x — x,) decrease with in- 
creasingly large ” only as fast as the ratio 1/n, it follows that those for 
the function f™ (x) can do no better although they are certain to do as 
well. These coefficients are given by 


wef mera Uo 


and hence the coefficients for the function f(x) are, according to Eq. 162, 
expressible as 


an = 


an 
(jn)* 
One may, therefore, conclude that for a function f(x) which, together 
with all its successive derivatives up to that of the kth order,.is continuous 
throughout the fundamental range, and for which, therefore, the kth 
derivative is the first one possessing a discontinuity within this range, 
the Fourier coefficients are found to decrease in magnitude for large 
values of ” as rapidly as the ratio 1/n**}, 
' Furthermore, one may conclude that, inasmuch as the Fourier series 
for h(x — x,) converges uniformly over any region whose boundaries 
exclude the points x, +: an integer number of 272’s, the series for the 
function f™ (x) also converges uniformly over the same ranges. In the 


[165] 


A, = 
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immediate vicinity of a point of discontinuity this series no longer con- 
verges uniformly, but its partial sums s,,“” (x) in absolute value evidently 
remain smaller than some finite upper bound like the value S for the 
partial sum 144 or its closed form 147. Recognizing that the Fourier 
coefficients for the function f“- («) are those for the function f™ (x) 
multiplied by 1/jn, and following a procedure identical to that used in 
the investigation of the convergence of (x), one is led to the conclusion 
that the series for f*~” (x) converges uniformly even over a range which 
includes the discontinuity of f(x). The series for the derivatives of 
lower order, and so on down to that for the function f(x), then certainly 
do likewise. — 

An important result of this line of thought is the fact that if a given 
function f(x) is itself continuous but possesses a discontinuity in its first 
derivative at some point x,, the Fourier series for f(x) still converges 
uniformly at the point x = x,. It may, therefore, unquestionably be 
integrated term by term, and it may also be differentiated term by term, 
although the point of discontinuity x, must then be excluded from the 
region of uniform convergence. 

Each time the Fourier series is integrated term by term, an additional 
factor is introduced into the denominator of the expression for an, 
and each time it is differentiated, a factor of ” is canceled out of the 
denominator of this expression. It is readily seen, therefore, that integra- 
tion improves the rate of convergence of the Fourier series, whereas 
differentiation makes the series converge more slowly. This fact may have 
been expected in the first place since integrating a function makes it 
smoother whereas differentiating it accentuates any existing irregularities. 

It is a simple matter to extend these conclusions to include functions 
having a larger number of discontinuities, as long as this number remains 
finite. For example, if the function f(x) has the discontinuities 61, 52, 
. ++ 8, at the points 21, #2, -+ + x» within the fundamental range, the func- 
tion 


F(x) = f(z) — Eade a) [166] 


is continuous throughout this range, whereas the Fourier series for that 
part of f(x) represented by 


& bike — x) [167] 


is given by the sum of v series, each of which has the same general form 


as that for A(x). The Fourier series for f(x), therefore, converges uniformly 


over ranges whosé boundaries exclude the finite number of discontinuities, 
and the coefficients decrease in magnitude with large values of as fast 
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as and no faster than the ratio 1/n. The series may be integrated term 
by term, yielding a series which converges uniformly throughout the 
fundamental range, but it cannot be differentiated term by term since 
the resulting series then no longer converges at all. 


te 
~ Pe2h (x) PS 


Fic. 9. Two saw-tooth waves add up to yield a square wave. 


The second example to be considered is the rectangular wave with 
identical positive and negative half cycles of unit magnitude as shown 
in Fig. 9. This function has a discontinuity equal to 2 at the beginning of 
its fundamental range 0 S$ x S 27, and another equal to —2 at the 
center of this range. The saw-tooth functions representing these two 
discontinuities are, therefore, given by 


2h(x) and —2h(x — x) [168] 


After subtracting these from the given function in this example, one 
finds that there is nothing left, that is, 


f(x) — 2h(x) + 2h(e — xr) = 0 [169] 


Hence, according to Eq. 141, the Fourier series is given by 


2fsinx sin 2x 
fe) =2[= 5 stage +] 


|= eo + mere +] 170] 
With 
sin n(a — 3) = (—-1)" sin ux [171] . 


this result yields for the Fourier series of the square wave shown in 
Fig. 9 


sinz | sin 3x ney] [172] 


4 
je) = 2/4 


As one should expect from the fact that f(x) is odd, the representation 
is given by a sine series alone, and since the function satisfies the condi- 
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tion 98, only the odd harmonics are present. When the origin is shifted to 
the point « = 7/2, the function becomes an even one and the series is 
converted into a cosine series. This shift is accomplished by replacing 
x in Eq. 172 by x + 7/2, thus 


T 4[cosx cos3x . cos5x 


in which it is to be observed that the algebraic signs alternate. It may 
also be noted from Eq. 172 that, at the points of discontinuity of the 
function, the series yields the value zero which again is the algebraic mean 
between the immediately adjacent values of f(x). 


Fic. 10. A triangular wave which results when the rectangular wave of Fig. 9 is 
integrated. 


Integrating the series 172 term by term, one obtains the Fourier series 
representation for the triangular wave shown in Fig. 10 which is recog- 
nized as the result of integrating the square wave of Fig. 9. Thus, for 
the function of Fig. 10, 


ie x , cos 3a, cos 5x% oar ] [174] 


A lean ae 9 25 


A point which may be a bit puzzling here is the fact that no constant 
term appears in the expression 174, whereas if one mentally visualizes 
the function which represents the area under the square wave of Fig. 9, 
starting the integration at x = 0, this function evidently turns out to be 
a triangular wave lying upon the «-axis, that is, having the form of the 
wave shown in Fig. 10 but with an additive constant component equal 
to x/2. The reason that this constant component is missing from the. 
representation 174 is, of course, the fact that this result is the indefinite. 
integral of the series 172 and not the integral from zero to some variable. 
point x. Incidentally, the definite integral from +/2 to any variable 
point x has no constant component, as is readily recognized from inspec- 
tion of the Fig. 9. At all events one must recognize that the indefinite 
integral can yield only what is customarily referred to as the alternating 
component of the resulting function, and this in the present example is 
the triangular wave of Fig. 10. 
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It should be particularly noted that the series 174 converges consider- 
ably more rapidly than the series 172 or 173. Thus the coefficients in 
the series 174 vary as 1/n? whereas those in the series 172 vary only 
as 1/n. This observation agrees with what should be expected from the 
preceding discussion inasmuch as the triangular-wave function is con- 
tinuous whereas its first derivative, the square-wave function, possesses 
discontinuities. 


9. THE FourIeER SPECTRUM 


The interpretations given in the present article are most commonly 
used in connection with functions which are periodic with Tespect to 
the time ¢. The essential manipulations are, moreover, most effectively 
carried out in terms of the complex form of the Fourier series, which is 
expressed by Eq. 121 together with the formula 122. For « = wt these 
expressions are usually written 


f@ = = aera! [175] 
with , | 
a= f : Feri" at [176] 

TJ —ax jw 


Here it is useful to regard the general coefficient a, as a function of 
the variable mw, which is the angular frequency of the harmonic com- 
ponent of order 2. Since » assumes only integer values, the variable nw 
is a discontinuous one, and the function a,, which may alternatively be 
denoted as a(nw), has values only for discrete values of its independent 
variable. Nevertheless, it is useful to regard a(nw) as a function rather 
than merely as denoting the values of the Fourier coefficients, since this 
view leads to an interesting and useful interpretation of the relations 
175 and 176. 

The variable ¢ is regarded as the independent variable in a certain 
region designated as the time domain, whereas the variable uw represents 
an independent variable in a corresponding region known as the frequency 
domain. The function which is given as f(¢) represents a specification of 
some desired function in the time domain, and the corresponding func- 
tion a(m«), or an, found from the integral Eq. 176, is regarded as the 
specification of that same function in the frequency domain. In other 
words, the function a (nw) is in every respect just as complete.and specific 
a Tepresentation of the desired function as is f(t), the only difference 
being that a(nw) represents that function in a different dornain, namely 
that domain in which mw instead of ¢ is the independent variable. 
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This view that the function a(mw) is entirely and uniquely the equiva- 
lent of the function f(¢) is indisputably tenable since the relation 175 
uniquely converts the function a(nw) into the function f(#), whereas the 
relation 176 does the reverse. In other words, these two expressions are 
a pair of mutually inverse relations in the sense that one of them undoes 
what the other does. This circumstance may be placed even more clearly 
in evidence through substituting the expression 176 for a, into 175, 
which operation incidentally requires writing for the variable of integra- 
tion ¢ in Eq. 176 some other symbol % so as to avoid confusion with the 
independent variable ¢ in the expression 175, thus 


Lr) lo . 
$0 = ES [power du [17] 


By means of this expression, in which the variables ¢ and u both refer 
to the time domain, the given function f(¢) is expressed in terms of itself. 

The pair of relations 175 and 176 may, in the light of this interpreta- 
tion, be regarded as transformations which accompany the change of 
variable from ¢ to mw or vice versa. For this reason the function a(nw) is 
sometimes spoken of as the Fourier transform of f(¢), and the latter as the 
inverse transform of a(nw). In their present form the transformations 175 
and 176 are restricted by the condition that the function f(¢) must be a 
periodic function defined over the entire time domain —0 <i < o, 
The removal of this restriction and the accompanying modification in the 
Fourier transforms are discussed in Art. 19. 

Just as the function f(f) may be represented graphically by being 
plotted in the time domain, so the equivalent function a(mw) may be 
plotted in the frequency domain. The latter domain is more commonly 
known as the frequency spectrum and the plot of a(mw) as the Spectrum 
representation of the function f(t). 

Inasmuch as the function a(mw) is in general complex, it is necessary 
to make two plots in order to represent it completely. The ones usually 
chosen are the magnitude lern| and the angle ¢,, although the real and 
imaginary parts could also be chosen. Plots of |an| and ¢, are called 
respectively the amplitude and phase spectra of the given function f(é). 
Since mw assumes only discrete values, these plots are not in the form of 
continuous curves but consist merely of a series of vertical lines repre- 
senting the ordinates of the functions |a,| and ¢, corresponding to integer 
values of ”. For this reason they are referred to as line spectra. 

The square-wave function of Fig. 9, for example, yields, according to 
the formula 176, 


F ee fees. —jnet ee —~jnwt 
as A te d+ [etme at [178] 
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or, the change of variable ¢—> t — /w being made in the first integral, 
@ e ae foo : 
an = (1 — or) [einer at [179] 
2x 10 


Integrating and substituting limits give 
= hes e™™)(1—e"™) — (1 — cos mr) 


180 
Qrnj ny ite) 


n 


Hence 
0 for ” even 
an = 
‘ cs for » odd [181] 
jam 


Substituting this result, into the sum 175 and combining the conjugate 
terms yield the series 


[182] 


f@) = |= wt fe 3k 4, cain Sut diet | 
a 3 5 
which checks Eq. 172 for x = wt. The amplitude and phase functions are 


2 
len] = — (for 2 odd) [183] 
which may be written 
2 1 
la(nw)| = oe (for n odd) [184] 
and 
v 


Since the phase function is a constant, there is no need to plot it. The 
plot of the amplitude function 183 or 184 is facilitated through plotting 
first a continuous dotted curve for this function, assuming mw to be a 
continuous variable, and then erecting a set of ordinates from the mw-axis 
to this dotted curve at the points corresponding to odd integer values of 
n, as is illustrated in Fig. 11. The dotted curve in this exainple is simply 
a rectangular hyperbola. After this dotted curve is drawn, any number of 
lines are readily inserted. A plot of this sort gives one a good idea of the 
rate of convergence of the series, since the ordinates are proportional to 
the values of the various harmonic amplitudes. 

A second interesting example is given by the function shown in Fig, 12. 
This consists of a periodic succession of identical rectangular pulses of 
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unit amplitude and duration 5, with the time origin chosen at the center of 
one of them. Application of the formula 176 in this case yields 


: 6 
: sin % = 
5/2 —jnwt 5/2 
On = f evinat gy — © | = 2 [186] 
2rJ—s ne 


—6s /2 


/2 _ —2Qxrnj 


a 3 5 7 9 Mu 


Fic. 11. The amplitude function in the Fourier analysis of a rectangular wave. 


which is preferably written 


6 , 
7 5c as 5 3 Sue 1187] 
aa Rie 46 _ ee 
2 2 , 


Since the expression is real, the phase function in this example is zero, as 
is to be expected from the fact that the function f(#) is even. 


Fre. 12. A periodic succession of rectangular pulses. 


The amplitude function has the form (sin x)/x which is equal to unity 
at x = 0 and zero at integer multiples of +. The general appearance of 
this function is that of a harmonic oscillation of decreasing amplitude, 
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although it is, of course, not simple harmonic. At the point « = 7/2 the 
value is 2/x; at x = 3x/2 it is (1/3)(2/m); at x = 52/2 it is (1/5) (2/7), 
and so forth, the maxima and minima lying approximately at the points 
x = 3n/2, 5x/2, ---. With these items in mind a dotted curve showing 
the function 187 versus nw is readily plotted. It remains to draw in the 
ordinates corresponding to integer values of 2. Before this can be done 
one must choose a particular value for the ratio of the duration 6 of the 
impulse to the fundamental period 7. For the present example, this ratio 
is chosen as 


[188] 


Fic. 13. The amplitude function in the Fourier analysis of the succession of 
rectangular pulses. of Fig. 12. 


and since r = 2x/w, this choice corresponds to a fundamental angular 
frequency of 
me : 2a 
| eel hare [189] 

As shown in Fig. 13, the point mw» = 2/6 marks the first zero of the 
dotted curve, and since the fundamental frequency corresponds to m = 1, 
this point is located one-fifth the distance from the origin to the point 
mw = 2x/8. The various ordinates for other integer values of ” are then 
readily drawn. For the particular choice indicated in Eq. 188, the ampli- 
tudes of the fifth, tenth, fifteenth, - - - harmonics are zero because these 
points coincide with the zeros of the dotted curve. 

It is to be observed that the form of the dotted curve is the same 
regardless of the ratio 6/7, and that only the spacing of the lines in the 
spectrum is dependent upon this value. Incidentally, the amplitude of 
the constant component, corresponding to ” = 0, is seen to be the largest. 
All amplitudes are proportional to the duration 6, so that as the ratio 
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8/r is decreased, 7 remaining constant, the dotted curve remains intact 
except for a change in the scale factor for its ordinates. 


10. POWER PRODUCTS AND EFFECTIVE VALUES 


In the discussion of problems in electric circuit theory, determining 
the average value of the product of two periodic functions is at times 
necessary. Since this product is again a periodic function with no larger 
fundamental period, it is sufficient to restrict the evaluation to one period. 
If the two functions are denoted by /,(#) and f(t), the problem calls 
for the evaluation of the integral 


=. of " ful)fald) dt [190] 


This evaluation is particularly simple if the complex form is used for 
the Fourier series representation of the functions f;(¢) and f2(¢). These 
functions may be expressed as 


AQ = ZX aneinet [191] 
and > 
fa(t) = = Bnet [192] 
The product is given by the double sum 
fi) -fe® = 7 = abe tet | [193] 


Substituting this expression into the integral 190, and assuming that 
the series 191 and 192 are uniformly convergent so that the integration . 
may be carried out term by term, one has 


elo | 
= 2 EZ eabn i eiintm)at gy [194] 


Trnm=—2 
The integral involved here is a very simple one. Its evaluation reads 
2a 


ajo giintm)2n _ 4 — form=—n 
f ei(ntm)ot gy — —________ = {wo [195] 


i(n +m 
it ye - 0 form #~.—n 


‘ Consequently all the terms in the double sum 194 vanish except those 


for which m = —n. The function J, however, still requires an infinite 
sum for its representation, that is, 


n=—0 
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which is the desired result. It may be written in the alternative form 
J = abo + L (@nB_n + a_nBn) [197] 


or, if one writes, 
an = lan| ef? Bn oa \Bn| evn [198] 


the result 197 is found to be equivalent to 
J = apBo + 2 xu lanBrn| cos (n — én) [199] 


Because the integral 195 is zero for m * —n, the interesting result 
follows that the average value 190, given by 199, depends only upon the 
products of the amplitudes of harmonic components of itke order. In 
other words, none of the cross-product terms resulting from the product 
of the two series for f,(é) and f.() contributes to the average value of 
this product. In a physical problem in which f; (t) represents a source 
voltage and f2(¢) the resulting source current, J represents the average 
power delivered by the source. The integral 190 is, therefore, also referred 
to as the power product of two periodic functions. The coefficient 
cos (Wn — ¢n), which occurs in Eq. 199, is in electric circuit problems 
called the power factor corresponding to the harmonic of nth order. 

A very closely related problem is the evaluation of the integral 


[FOP at [200] 


® Qa feo 
Dar Jo 
which is referred to as the mean square value of the single periodic func- 
tion f(£). This is, however, merely the integral 190 for An® =f =f, 
and hence the desired result is expressed by Eq. 196 or 199 for a, = Bn. 
Hence, if f(¢) is assumed to be given by Eg. 191, 


J’ =a? +2 x lan|? [201] 


J’ 


The square root of this value is called the root-mean-square value or 
effective value of the periodic function f(t). Recalling, according to Eqs. 
94 and 135, that the harmonic amplitudes c, in Eq. 95 are given by twice 
the magnitudes of the complex coefficients a,, one sees that the effective 
value of the periodic function f(é) is given by the expression 


2 ° 
(f(t) Jettective = Vr + : x Ca [202] 


which contains the particular result that the effective or root-mean-square 
value of a sinusoidal function equals its amplitude divided by V2. 


. 
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11. SUMMATION FORMULAS 


It is useful to note that the Fourier series for various particular func- 
tions may be used to obtain in closed form the values for numerous 
special forms of infinite series. For example, from the function h(x) of 
Fig. 8 and its series 141, or from the function f(x) of Fig. 9 and its series 
172, one obtains by setting x = 7/2 the result that 


gy Pea cee are [203] 


Similarly, since the series 174 for the triangular function of Fig. 10 is 
still uniformly convergent for x = 0, one has 


rr 


1 1 1 
gp toe tT Gg [204] 


The evaluation of many other infinite series may be obtained in this way. 


f(x) 


a 


sin 
-T t 


Fic. 14, A plot of the function of Eq. 205, the Fourier series of which is used in 
several summation formulas. 


It is of greater practical value to recognize that similar means may be 
employed for the determination of more general summation formulas. 
For example, one may consider the function f(x) defined by 


fle) = sinu(E +a (for —r <x“ <0) 
[205] 


f(x) = sin u (5 — ) (for 0 <x < 7) 


in which » may have any value. This function is shown plotted in Fig. 14 
for the value » = 24. The function is even and satisfies the condition 
98. Hence its Fourier series is represented by a cosine series with odd 
harmonics, and the coefficients are given by the formula 105. Substituting 
from Eq. 205, one has 


ae ? [ie cos nx dx = 2 [sina - 2) cos na dx [206] 
x JO a3 Jo 2 
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_The integration yields 


: F T T ba 
2n sin nx sin pw be — 2u cos nx cosu{-— — x 


a, = a 
™ a(n? — pu?) 
T 
2u(1 — cos mr) cos wu 3 
r a(n — p) [207] 
or 
4u COS ps = 
Qn = (ne? — 4) (for n odd) [208] 
0 (for even) 
The Fourier series for this function is, therefore, given by 
bails 3 = cos nx 
je ae eee 209] 


which converges uniformly for all values of « since the function f(x) is 
continuous. Substituting from the relations 205, one may write for the 
intervalO <“%<-4 


. vie 
sin a 
c G ) > COs nx 


3 «C210 


© nmiasee 1 — 
4u cos p 2 


Particularly, for x = 0, this result yields 
8 eo, a So 
4u “a 2 n=1,3,5,<0° (n = py) 


which is the partial fraction expansion of the tangent function, and is 
found to be a useful summation formula in connection with a number of 
problems in circuit theory. 

Other formulas may be developed from the relations 210 and 211. 
For example, subtracting the former from the latter, and using the 
trigonometric identity 


[211] 


1 — cos nx = 2 sin? nS [212] 
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one finds 
finn — sn (5 -=)| nt 
mjsin » > — sin u a 2 sin’ m5 
Fux) = = 2 —z——, ~=C«(213] 
wv n=1,3,5,°°° n(n —p ) 
Su COS 15 


which is a somewhat more general summation formula than Eq. 211. 

Now writing the Eq. 213 first for » = m1 and again for u = ue, and 
subtracting the second from the first, give a more general formula which 
reads 


2, % 
F (u1,%) ae f(u2,%) = rE oe 2 [214] 
Ha? — pe” nates. (0? — m1") (2? — ps”). 
The quantities », 41, and pe in these formulas may have any real or 
complex values. 

Formulas having the form of Eqs. 213 or 214 are useful in electric 
circuit theory because the quantities (n? — ux”) have the form of the 
resonance factors of an impedance function. Since the latter is a rational 
function, that is, a quotient of polynomials, it is always factorable in 
terms of the roots of these polynomials. Hence the impedance is ex- 
pressible as the quotient of products of factors of this type. The expression 
for the power absorbed by an electrical network which is excited by a 

‘periodic source containing only odd harmonics (this case is most com- 
mon) assumes the form of the infinite sum in Eq. 214 when the network 
contains two degrees of freedom. 

For pe = 0, a special form of Eq. 214 results, reading 


* 9 « 
8f(u,x) — 7% ae > abs 2 [215] 
8p? n=ias}-. 1?(n? — p?) 


Other formulas may be obtained through differentiation of the preceding 
ones with respect to x. As long as the coefficients in the resulting series 
decrease for large values of m as rapidly as, or more rapidly than, the 
ratio 1/n?, then, according to Art. 8, the periodic function represented 
by the series is continuous, and the series still converges uniformly for 
all values of x inclusive of the boundary values in the relations 205. 
However, if the coefficients in the resultant series decrease for large 
values of 7 only as fast as the ratio 1/n, the series cannot be used for the 
boundary values of x, although it still converges uniformly for all other 
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x-values. For example, differentiating the formula 213 with respect to x 
gives 


nm sin nx 


[216] 


which is evidently no longer valid for « = 0 or x = x. However, differ- 
entiating the formula 215 with respect to x yields 


Ca) ee 

v BLS —-T Bo “ : 

2 2 - _ aa [217] 
Ay? T n=1,3,5,+°° n(n® — yp?) 
‘ OS 


which converges uniformly for all «-values. 

The formula 214 may be generalized so as to contain any number of 
factors (v2 — y,2) in the sum. Replacing w; and pa respectively by pe 
and pg and subtracting the resulting formula from 214 yield 


Sma ) + T (u2,x) f(u3,%) 
(ux? — 2”) (ui? — wa”) (ug? 41”) (27 —U3”) (ug? — #1”) (ug? — ug”) 
- sin? Pi 
me Be 


n=135,-++ (0? — 17) (Nn? ~ ps”) (n? — ys 


aay [218] 
Again, replacing u1, “2, 43 in this formula by ye, us, #4 respectively and 
subtracting the resulting formula from 218, one finds 


2 FT (un,x) 


> NE” 
wot (un? — mega?) (we? — mete”) (ue? — we4s”) 


ye sin? ns 
= Fo ooo —“‘é*_‘2119 
sce (m2 — py)(n? — ps”) (n? — 3”) (n® — ys”) i219] 


in which the subscripts & = 1, 2, 3, 4 are assumed to form a, cyclic group. 

Any desired further generalization is evidently possible. For example, 
by using Eq. 213, and forming the function af (u1,0) + df (u2,”), one 
obtains a formula which differs from Eq. 214 in that the terms in the sum 
contain a factor of the form (n? — y,”) in the numerator in addition to 
factors of this form in the denominator. 
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12. THE “LEAST SQUARES” APPROXIMATION PROPERTY OF THE 
FOURIER SERIES 


In this article the problem of finding a trigonometric series representa- 
tion for a given function f(x) is approached in a different manner. The 
trigonometric series which is to approximate the function f(x) is assumed 
to be finite and is written in the complex form 


Sn() = oa ayei** [220] 


The following question is now raised: What must be the values of the 
coefficients a; in order that the mean square error between f(x) and 
Sn(%) may become a minimum? In analytic form the 1 mean square error, 
expressed for the fundamental range, reads 


ee —f (f(x) — sn(x)]? dx [221] 


The problem, therefore, is to determine the coefficients in Eq. 220 in 


such a way that the expression 221 becomes a minimum. 
Now 


(f(x) — sn@)? = Lf@)P + [sa(@) — 2Lf(w)sn(~)] [222] 
According to Eq. 220, 
[sa(x)]? = z ajayerith2 [223] 
4k=—n 
so that 


at2r n +20 | 
7 of: [sn(x) |? dx = x _ 2 Hite f efitB2 dy [224] 


The integral appearing here has the same form as that in Eq. 195 of 
Art. 10. Hence 


a+ 2a n 
= f [sn(w)]? dx = Be ape [225] 
Qr a k=—n 


Next one needs to evaluate the integral 
a+ 2x n 1 fat2= : 
A esate) de = EB alt [peer as} (226 
TVa =—n a 


The quantity enclosed by the curved brackets is, according to Eq. 122 or 
Eq. 156, recognized to be the expression for the complex Fourier coefficient 
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op, so that 
1 a+ ln n 
A fen) de = Ean [227] 
Ta k=—n 


Substituting Eq. 222 into Eq. 221, and taking note of the results stated 
in Eqs. 225 and 227, one obtains for the mean square error the expression 


1 a 2 n n 
E=7 [f@)Pdxe+ DL agax~—-2 DL ayo, [228] 
Ta k=—n k=—n 
Now it is observed that 
2 : >» aair, = F > Bs (dpa + a_yor) [229] 
‘The last two terms in Eq. 228 may, therefore, be written in the form 
= (a,0_~ — Axa_~ — Ana, + ana_%) — = Oye_k [230] 


which is equivalent to 
= (ay — ax) (Gz — az) — i OR . [231] 


or, since the coefficients with negative subscripts are the conjugates of 
those with the same positive suhscripts, these terms are given by 


JE lee — aul? — Eau!” [232] 


Hence the mean square error, according to Eq. 228, is seen to be given 
by 
. UI ins 2 - 2 " 2 
B= J [f@)Pde+ ES lor —aal?— E loa|? [233] 


Inasmuch as the function f(x) and hence also the coefficients o, are fixed, 
the expression 233 evidently becomes a minimum for 


ay = OK [234] 


whence one may conclude that the sum s, («) as given by Eq. 220 approxi- 
mates the stated function f(x) over its fundamental range so as to make 
the mean square error a minimum #f the coefficients in the finite sum s,(x) 
are the Fourier coefficients for the function {(x). , 

When the coefficients a; are so determined, the mean square error 233 
becomes 


1 a+ lx n 
B= 5 [ [@)Par— Elo! [235] 
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which, according to Art. 10, is 
E= ZX lal? — ¥ loal? [236] 


Thus it is seen that the mean square error tends toward zero as n becomes 
larger and larger. In other words, the infinite Fourier series approximates 
the given function f(x) in such a way as to make the mean square error 
over the fundamental range vanishingly small. The Fourier series, or 
any of its partial sums, is, therefore, said to approximate a given function 
in the “ least squares ” sense. 

In view of this result it is easy to show why, at a point of discontinuity 
of the function f(x), the Fourier series always yields the arithmetic mean 
between the two values of f(x) immediately adjacent to the point of dis- 
continuity. Thus, if these two values are denoted by a and 4, and the 
value of the series at the point of discontinuity is denoted by s, the mean 
square error at this point is expressed by 


(s — a)? + (s — b)? 


5 [237] 


If this is to be a minimum, its derivative with respect to s must be zero, 
that is, 


$s—@+38.—b=0 [238] 
which yields 


a+b 
2 


[239] 


13. THE APPROXIMATION PROPERTY OF THE PARTIAL SUMS; THE 
GIBBS PHENOMENON 


When, in a practical problem, a trigonometric series is used to approxi- 
mate a given function, this series must for obvious reasons be a finite one. 
Consequently, it is of considerable interest to know more about the 
detailed manner in which the partial sums of a Fourier series approximate 
a prescribed function. 

* A partial sum reads 
n 
Sax) = 2 oye?** [240] 
k=—n 


in which 
1 ae 
a= 5 [ Toma [241] 
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Substituting this expression for a; into Eq. 240 and reversing the order of 
summation and integration yield 


se) => ['F@) E oo ae [242] 
S Qr ‘0 k=—n 
Now 
+= ee) = 14. cos (x — &) + cos 2(~ — £) + +++ + cos m(a — &) 
[243] 
and, according to Eq. 11, this gives 
e , sin | Qn + 12> 
fea) oo Ne 
2! 7 ) [244] 
sin\—> 


Substituting this result into Eq. 242, one finds that a partial sum may be 
expressed as 


se) = [Fue ade [285] 
in which 


sin | Qn + 1) i] 
u(é — 2) = —— G3 [246] 
sin(—> 

This function is shown plotted versus £ in Fig. 15 for the particular 
choice m = 5. It consists of a periodic succession of identical large humps 
with the amplitude (2m + 1) and the fundamental period 27, separated 
by a number of smaller oscillations which decrease in amplitude with 
increasing distance from the large humps, the smallest of these having 
unit amplitude and occurring midway between any two of the large 
humps. It is observed that only one large hump is encountered throughout 
the fundamental range 0 < x < 2m, and this occurs at. the point = 2. 
According to the relation 245, the partial sum s,(x) is given by the 
product of this peculiar function w(é — «) with the given function f(€) 
integrated over the fundamental range. The integration must be evalu- 
ated separately for each value of x for which the corresponding value of 
Sn(«) is desired. During each integration, x is treated as a constant. The 
function u(¢ — x), which contains « as a parameter, remains the same 
for various x-values except for a translation of this function as a whole. 
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The value of x merely determines the location of the large hump in the 
fundamental range. 

Figure 16 illustrates the function u(é — x) for a fairly large value of n, 
and shows its position in the fundamental range relative to the given 
function f(£) for some arbitrary x-value.* For the interpretation of 
the integration in Eq. 245 one may visualize the plane of Fig. 16 asa 
screen on which the function 2(¢ — x) is plotted and of which the portion 
corresponding to the shaded area under this curve is cut out. The func- 
tion f(£), which is plotted on a second screen located behind the first, is 


u(é-x) <—_—— width of fundamental range-————> 
I 


ee gq eq a a ee eee 


(2n+1) drawn for n=5 


fundamental range=27 


Fic. 15. The scanning function used in a study of the Gibbs phenomenon. 


assumed to be visible only through the portion of the first screen which 
is cut away. For a large value of 7, this cutaway portion is essentially 
in the form of a long narrow slit corresponding to the area under the 
large hump in the function «(¢ — x). If one neglects for the moment the 
small areas under the minor oscillations or ripples adjacent to this slit, 
and assumes further that this slit is sufficiently narrow so that the small 
portion of the function f(g) visible through it may to a first approxima- 
tion be regarded as characterizing the single ordinate of this function 
for the point £ = x, it becomes clear that the value of the integral 245 » 
is given simply by the product of f(x) and the area of the slit multiplied 
by 1/2. If 1/2 times the area of the’slit equals unity, the value of the 
integral, to a good approximation, equals f(x). In other words, one arrives 
at the reasonable conclusion that s,(”) & f(x). , 

*Since the ratio of the height of the large hump in the function u(é — x) to the width of 
the fundamental range is (2n + 1)/2z, it is not practically feasible to draw this figure, or 
Fig. 20, in correct proportion with regard to this ratio. These figures are, however, in correct 
proportion with regard to the ratios of the amplitude of the large hump to those of the adjacent 
oscillations, as well as with regard to the ratio of the width of the fundamental range to that 
of the large hump at its base. This ratio is equal to.» + yy. 
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The values of s,(x) for various values of x are obtained through dis- 
placing the first screen in the horizontal direction so that the slit uncovers 
different portions of the curve f(#) corresponding to the point § = x. 
This process of moving the screen with the slit corresponding to u(é — x) 
across the fundamental range, and viewing the function f(£) through it, 
is referred to as scanning f(£) with the function #(€ — x). The latter is 
called the scanning function. If this scanning function consists of a single 
rectangular hump of extremely small width but of such height that the 
area of the slit nevertheless is finite and equal to 27, the integral 245 
yields almost exactly the function f(«), the discrepancy becoming zero 
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Fic. 16. Pertinent to the evaluation of the partial sum of a Fourier series. 


as the slit width is allowed to approach zero. Actually, because of the 
deviation of the scanning function u(é — x) from this ideal limiting 
form, the integral yields a function s,(“) which is only approximately 
equal to f(x). 

For a relatively small value of (few terms of the Fourier series) the 
scanning function w(é — x) deviates quite considerably from its ideal 
form, and the partial sum s,(x), as one should expect, only crudely 
approximates the given function f(). As more terms are added to the 
partial sum, and m becomes relatively large, the form of the scanning 
function improves, and so does the degree of approximation between 
$,(x) and f(*). The improvement in the scanning function is due to the 
large hump’s becoming taller and narrower, as is clear from Fig. 15, in 
which it is‘indicated that the height of the large hump equals (2x + 1) 
and its width at the base equals 41/(2” + 1). At the same time, the adja- 
cent ripples become larger in number and hence also become narrower, 
in the same proportion, in fact, as the large hump becomes narrower. 

One disturbing feature about the behavior of this scanning function 
with increasing 7, however, is that the ripples immediately adjacent to 
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the large hump do not ultimately become smaller and smaller in amplitude 
relative to the height of the large hump but maintain amplitudes which 
asymptotically approach a finite ratio relative to the height of the large 
hump. The ultimate effect of this disturbing circumstance requires 
further careful investigation which is begun by study of the properties 
of the scanning function in greater detail. 

First it is significant to observe that the net area under the curve for 
u(é — x) over the fundamental range is indeed equal to 2x, and that this 
is true for any , as may readily be seen in several ways. For example, 
according to Eqs. 243, 244, and 246, 


u(é—x) =1+2 cos (E—«)+2 cos 2(E—a%) + +++ +2 cos n(~—x) [247} 


In evaluating the integral of this expression over the range 0<&<2r 
it is observed that all the integrals for the cosine terms yield zero. Hence 


Qa 2a 
cf ule 2) de = f de = 2 [248] 


This result may also be seen from the integral 245 through assuming as 
a special example that f(¢) = 1. Then s,,(x) must, of course, also equal 
unity; that is, the partial sum reduces to its constant term and this must 
have the value unity. The integral 245 then yields the result expressed 
by Eq. 248. 
~ “Thus it is established that 1/2 times the area under the scanning 
function is equal to unity, as the preceding discussion indicates that it. 
should be. However, this area is not confined to a single slit but is partly 
contributed to by smaller increments due to the ripples, which are alter- 
nately numerically positive and negative. A glance at the scanning 
functions shown in Figs. 15 and-16 reveals, moreover, that the net area 
contributed by the ripples is numerically negative so that the area under 
the large hump must exceed the value 27 if the resultant area under the 
entire function is to have this value. The amount of the excess area 
under the large hump depends upon the value of 2, but the important 
point is that this excess does not become zero as ” is indefinitely increased. 
In order to comprehend this fact, one must study the behavior of the 
function u(£ — x) for large values of n. To facilitate this study it is 
expedient to introduce in the form given by Eq. 246 the change of variable 


n= (n+ 1)55* [249] 


Then the scanning function assumes the form 
sin 4 


u(n) = ae) [250] 
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The region of the main hump corresponds to —# <7 < =, and each 
adjacent smaller hump corresponds to an increment of in the variable 7. 
As indicated in Fig. 16, the region over which the adjacent ripples have 
significant amplitudes is confined to the more immediate vicinity of 
the large hump. In other words, the entire region of interest for the 
function «(n) is confined to a finite interval for the variable » which, for 
example, may be described as —107 < 4 < 10x, or something of this 
order. 
With regard to the expression 250, if m is sufficiently large so that 
approximately 
107 T 
m+1~6 [251] 


little error is introduced in the determination of «() throughout the 
region of interest if the sine function in the denominator is replaced by 
its argument. In the limit 7 — © any error introduced by this procedure 


sin 7 


Fic. 17, A plot of the function which occurs many times in Fourier analysis. 


becomes vanishingly small. Hence, for large values of 1, one is permitted 
to write for the scanning function 


u(n) = On +1) (=) [252] 

The function (sin »)/n has appeared before in these discussions (see 
Fig. 13 of Art. 9) and will appear again in a variety of problems. The 
reader may, therefore, well spend some time becoming thoroughly 
familiar with its characteristics. It is plotted again in Fig. 17, which 
illustrates some of its more immediately apparent properties. At the 
origin, 7 = 0, the function has the value unity, and a zero derivative, 
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as is apparent if one writes the Maclaurin series for (sin n)/n, thus 


3 5 
nv 4 
sin 7 ™3t sr t 0? nt 
aoe 7 = =e ag [253] 
from which it is also clear that the function is even. At the points 
n= +n, +2x,+-- the function passes through zero. Aside from the 
maximum at 7 = 0, the function has further maxima and minima at 
the points where 
d ‘G ") 
—(—— }=0 254 
ace [254] 
or for 
tany = 7 [255] 


The roots of this transcendental equation are very nearly, and for suffi- 
ciently large » almost exactly, equal to 


peed [256] 


Actually they are slightly smaller than these values, but as a sketch of 
the relation 255 readily reveals, the discrepancies even for the first few 
roots in the set 256 are hardly noticeable in a graphical representation. 
The values of these approximate maxima and minima are most con- 
veniently expressed in terms of the value of the function for 7 = +7/2, 
which is 2/7. The approximate minimum at 7 = +32/2, which is a nega- 
tive value, is one third of 2/7; the maximum at 9 = +57/2 is one fifth 
of 2/x; and so forth. In other words, the maxima and minima in absolute 
value become smaller with increasing » like the ratio 1 /y. For example, 
the fourth extremum at 77/2 has the value 2/7x = 0.091, or only about 
9 per cent of the value of the function at 7 = 0. 

With regard to the area under the curve of Fig. 17 it is significant that 
the function 


Si (x) =< f i dn [257] 


which represents the area under the curve for (sin »)/» from the origin 
to some variable point 7 = x is a familiar function in Fourier analysis 
and allied matters. A short discussion of some of its properties may well 
be given at this time. 

The shaded area in Fig. 18 represents the value of the function Si (x), 
called the “ sine-integral of x,” for a particular value of x. The essential 
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properties of this function are easily obtained from inspection of this 
figure. 
First one may observe that for values of x near the origin, the area 
evidently increases very nearly, in fact for x = 0 exactly, as x. This 
circumstance may readily be seen by 
a observing that the function (sin 7)/n 
has a zero derivative (the curve is 
horizontal) at the origin. Hence 


As the curve for (sin 1)/n begins 
Fic. 18, The shaded area is the sine- to fall, however, the rate at which 
integral of x. Si (%) increases with x falls off, and 

it eventually becomes zero at x = 1. 

Here the function Si (x) reaches its first maximum. From x = 7 to 
x = 2x the Si-function decreases from its first maximum by the amount 
of the negative area enclosed by this portion of the (sin n)/n curve. By 


Si (x) 


y4-~-------- 


Fic, 19. A plot of the sine-integral of x. 


the same sort of reasoning, it is readily seen that the function Sz (x) 
has the general character shown in Fig. 19. The alternately positive and 
negative areas contributed by the ripples in the function (sin 7)/» become 
smaller and smaller with increasing x values, and the net area, therefore, 
approaches a finite value asymptotically and in an oscillatory manner. 
The asymptotic value is readily obtained from the result stated by 
Eq. 248, namely, that the net area under the curve for the scanning 
function is equal to 2x for any ” and any x. This equation may be written 


pS u(é — x)d(— — x) = Qe [259] 
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or, with the use of Eqs. 249 and 250, one has 


2 i alee /2 sin 7 
Qn +1J-@ntine , ( ” i = 2n [260] 
sin (| ——-—— 
2n+1 


In the limit 2 — o this result reads 
2 f a an mi Ox [261] 


and since the function (sin 7)/n is even, it follows that 
” sin 7 v 
A dyn = 5 [262] 
which is the asymptotic value of the function Sz (x), as indicated in 
Fig. 19. 

With regard to the evaluation of the integral 245 for large values of m, 
according to the scanning process illustrated in Fig. 16, several points of 
significance may now be brought to the reader’s attention. First it is to 
be observed from Fig. 17, which according to Eq. 252 represents the 
appearance of the scanning function for large values of m, that the smaller 
- negative humps immediately adjacent to the large hump have amplitudes 
which remain slightly greater than one-fifth the amplitude of the large 
hump no matter how large » becomes. As already pointed out, the 
scanning function, therefore, does not approach its ideal form as m 
increases without limit, since the net area under the scanning function 
does not become equal to the area under the large hump. The latter 
area is, for large values of ”, equal to twice the area under the curve of 
Fig. 17 between » = —a andy = x. (The factor of two enters here because 
of the change of variable given by Eq. 249, as is also seen in the steps 
leading to Eq. 261.) This area is equal to 4 Si (), which is found (from 
tables of Si-functions) to be about 18 per cent larger than 27. 

In the limit » —+ ©, the scanning function may be visualized as result- 
ing when all the side oscillations in Fig. 16, together with the large hump, 
are compressed so as to form a single ordinate of infinite height. The ex- 
cess area under the large hump then virtually becomes coincident with the 
residual areas of the ripples, and since the net area always equals 27, the 
ultimate form of the scanning function may in a sense be said to have met 
the ideal requirements. Yet for any finite 7, however large, there exists a 
departure from the ideal which has a definite effect upon the approxima- 
tion property of the partial sum s,(x). This departure becomes particu- 
larly marked in examples in which the given function f(x) possesses dis- 
‘continuities. 
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In order to illustrate this fact a function f(x) is assumed to be zero 


from x = 0 to x = x, and equal to unity over the remainder of the 
fundamental range from « = x, tox = 2x. The integral 245 then becomes 


1 
sult) = 5 fue — 2) at [263] 
This expression may be replaced by the two integrals 
_ 1 2a _ eS 4 = r 
sul) = yf we — 2) de 5 fuga) dg [264] 
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Fic. 20. The scanning function for » = 28 applied to a rectangular pulse. 


These manipulations are clarified by reference to Fig. 20, in which the 
scanning function for 7 = 28 as well as the particular function f(£) con- 
sidered in this example is plotted. Thus it becomes clear that the first 
integral represents 1/2x times the area under half the scanning function 
and, therefore, has the value 3. By use of Eqs. 249 and 252 in the second 
integral, the relation 264 becomes 


1 (2rtD@—) 2 sin 7 


1 
which, according to Eq. 257 is equivalent to 
ar ae 
Suv) = 5 + Silt 3) (@ — %)] [266] 


‘This expression shows how the discontinuity of the function f(*) at 
x= x, is represented by the partial sum s,,(«). The second discontinuity 
at x = 2m is represented in an exactly similar manner, with the result 
that the partial sum for this periodic function f(~) and the choice nm = 28 
assumes the form shown in Fig. 21. 

It must be admitted, of course, that a discontinuity in the function f(x) 
offers an extreme test of the ability of the partial sum s,,(7) to approxi- 
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mate this function. In other words, for a continuous function f(x), the 
partial sum s,,(x) with the same number of terms yields a vastly better 
degree of approximation. Nevertheless, it is practically significant to 
investigate the approximation property of the partial sum in the most 
adverse circumstances. In this respect one observes from the appearance 
of Fig. 21 that residual discrepancies remain even for very large values. 
of 2. As the latter is further increased, this figure is changed only in that 
the ripples in the vicinity of the discontinuities of f(x) show a propor- 
tionately increased rate of oscillation versus the variable x, whereas their 
amplitudes relative to the magnitude of the discontinuity remain the same. 


8(x) 


Fic. 21. The result of the operation illustrated in Fig. 20. 


‘In the limit m—» ©, these ripples are compressed into a single vertical 
line at the point of discontinuity, but even in this limit the Fourier series 
is still observed to yield the overswing of 18 per cent* which is charac- 
teristic of the function Si (x). It is true, of course, that in the limit 7 > © 
this overswing together with the adjacent ripples occupies zero space in 
the fundamental range, so that practically speaking one may say that no 
residual discrepancy between the series and the function remains. Never- 
theless, the phenomenon is noteworthy from a mathematical standpoint 
inasmuch as it illustrates the ultimate effect of the failure of the scanning 
function properly to approach its ideal form in the limit 1 —> ~. It is, 
moreover, of practical concern also since it reveals the disheartening fact 
that, by means of a finite portion of a Fourier series, a given function can 
never be approximated in the vicinity of a discontinuity with a tolerance 
less than the characteristic 18 per cent overswing, no matter how many 
terms one may be willing to use. 

This peculiarity of the Fourier series, referred to in mathematical terms 
as the Gibbs phenomenon, is a very real disadvantage with regard to 
certain practical problems. Because of this phenomenon, it is necessary 
in connection with certain approximation problems to use other types of 
trigonometric series which are appropriate modifications of the Fourier 


*This percentage is based upon half the value of the discontinuity. In terms of the whole 
discontinuity the overswing is 9 per cent. 
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series. An important modification of this sort is discussed in the next 
article. 


14, APPROXIMATIONS BY MEANS OF FEJER POLYNOMIALS 


The appearance of the Gibbs phenomenon in the Fourier series is evi- 
dence of the failure of the uniform convergence of that series in the 
vicinity of a discontinuity of the given function. In other words, the 
partial sums s, (x) no longer converge toward a definite limit with increas- 
ing for the immediate vicinity of a point of discontinuity of f (x). It is 
the object of the present article to show that this failure in the uniform 
convergence of the series, and the associated Gibbs phenomenon, may be 
removed if the sequence of the partial sums s,(x) is replaced by the 
arithmetic mean sequence defined by Eq. 146 of Art. 9, Ch. VI. 

This sequence reads 


So(x) = So(x) 
So(x) + s(x) 


Si(x) = 2 
So(x) + 51 (x) + So(x 
S.(e) =~ E sala) 
y+ 1n=0 ve 
Substituting from Eq. 240, and utilizing the notation 
en rae [268] 


one recognizes that this sequence may alternatively be expressed as 


So(x) = ao = oy 

S1(x) = ag + $e, cos (x + $1) 

So(x) = ag + 3c, cos (% + $1) + 42 cos (2% + do) . [269] 
y—1 y—2 

S,(x) = a9 + —— @ cos (x + $1) + —— ¢2 cos (2” + 2) 


1 
+ +++ =, 008 (re + $y) 
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The coefficients a, are the Fourier coefficients defined by Eq. 241, and 
hence the partial sums appearing in Eq. 267 are, according to Eqs. 245 
and 246, given by 


aman 55 5 =} 


F {! = = 

sin )~> 
For the following manipulations it is now useful to observe that 
sin | (2n + 1) iS +I sin | Qn +1) 3 =} sin (7 7 *) 


| sin & *) sin? ‘ou 5 *) 


_ cos n(é — x) — cos [(m + 1)(& — «)] 


a) i—*) 
2 sin? ( 2 


By use of a 270 in the last of the relations 267, it is found that 


sate) = = [" Fe: “de [270] 


[271] 


S(%) = aol fOE z cos n(t — x) — cos [(n = ae oN) ae 
2 + 1) sin “(i 2 ) 
[272] 
But 
35 {cos n(—x) — cos [(n+1) (€—«)]} =1— cos (E—x)+ cos (E—x) 
— cos 2(é — x) + cos 2(—E — x) — -+- — cos (» + 1) (E — x) 
=1—cs(v+1)(—2) = 2 sin? | ( + 1 §=*| [273] 


Hence Eq. 272 becomes 
, : 1 2x 
Sy) = 5 [I@) - 006 - 2) at [274] 


: x)\? 
1 sin | 


a eo mee). 
2 


in which 


[275] 


The expression 274 for the trigonometric polynomial S,(x) is identical 
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with the form 245 for the partial sum s,(«) except that the scanning 
function now is v(~ — x), as given by Eq. 275, instead of u(é — x) as 
given by Eq. 246. Comparing the function «(§ — x) with v(é — «) shows 
that the latter is essentially the square of the former. With » = 2n, this 
simple relationship between u(£ — x) and o(§ — x) is true except for the 
factor 1/(v + 1) in the expression for v(é — x). But for this scale factor, 
the plot of Eq. 275 for» = 10is simply given by the square of the function 
u(é — x) plotted in Fig. 15 for m = 5. The result is illustrated in Fig. 22. 


4a | ; 
vei 


fundamental range=27 


Fic. 22. A plot of the scanning function that results when Cesaro summation is 
; applied to a Fourier series. 


A comparison of Figs. 15 and 22 clearly reveals the superiority of the 
scanning function v(t — x) over the function «(€ — x). Most significant 
in this respect is that the present function has no negative values, from 
which it follows that, at a point of discontinuity of the function f(x), the 
partial sum S,(x) cannot overswing. Moreover, the ripples adjacent to 
the large hump in the function v( — «) have considerably smaller 
amplitudes than those for the function u(é — x). 

The net area under the scanning function »( — x) for the fundamental 
range is again equal to 27, as may most easily be seen through considering 
the integral 274 for the special case f(t) = 1, whence S,(x) must also 
equal unity. Since the function v(f — x) has the period 2z, it is thus 
established that 


- 2 
1 sinfo +1) £5" | 


Qa(v + 1) —* ss ( = *) 
sin 
2 


dé =1 [276] 


for any x and any ». 
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With the change of variable 


n= @ +1454 [277] 
the function 275 becomes 
_ 1 sing \? 
vn) = 5 2 ner" [278] 
sin © 4 


For large values of » the scanning function is, therefore, very nearly 
given by 


H 2 
o(n) = @ + 1) (= *) [279] 
The integral 276 then reads 
Ln ea et “y _ 
Z Bana ; dn =1 [280] 
and for » — o one has 
1 ¢* /sinn\? 
1 (= ") 24 [281] 
TJS—« ” 


From Eq. 279 it is found for large v-values that the area under the 
large hump in the scanning function »( — x) is equal to approximately 
90 per cent of the total area of 27, and that the combined area under the 
two small humps on either side is about 5 per cent of the total area. 
The amplitude of the small humps immediately adjacent to the large one 
is about 4.5 per cent of the large amplitude. These ratios hold for reason- 
ably large finite values of v (about 10 or more) and do not vary as » is 
increased without limit. 

With this scanning function, the function f(é) of Fig. 20 yields a 
partial sum S,(x), according to Eq. 269, which has the form shown in 
Fig. 23. In order that this figure may correspond to the same width of the 
large hump and of the adjacent ripples relative to the fundamental range 
as for the scanning function of Fig. 20, it is drawn for » = 56, which is 
double the 2-value chosen for Fig. 20. In other words, it is necessary to 
have twice as many terms in the partial sum 269 as in the partial sum for 
the Fourier series in order to obtain the same slope at the points of dis- 
continuity of the function f(x). However, the Gibbs phenomenon is now 
completely suppressed. Instead, the increment in S,(x) near the point x1 
equals only about 90 per cent of the value of the discontinuity, corre- 
sponding to a displacement of the scanning function »(¢ — x) by an 
amount equal to the width at the base of its large hump. A succeeding 
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- displacement equal to the width of the next adjacent small hump yields a 
further increase in S,(x) of about 5 per cent, and so on. 

The essential difference between the behavior of the partial sum 

S,() and that of s,(x) in the vicinity of a discontinuity of the function 


Sy (x) 


Fic. 23. The result of scanning a rectangular pulse with the scanning function of 
Fig. 22. 


f(x) may be seen from a comparison of the function Si (x), defined by 
Eq. 257 and illustrated in Fig. 19, with the function 


O(x) = f ; (=2) dn [282] 


The latter is shown plotted in Fig. 24, in which the dotted curve rep- 
resenting Si (x) is also drawn in order to facilitate the comparison of 
these two functions. They both have the same asymptote, but Si (x) 
converges toward this value in an oscillatory manner, whereas the 
function Q(x) approaches it monotonically. 

The conclusion is that the so-called Fejér sum S,(x), giver: by Eq. 269, 
converges toward a definite value as » is indefinitely increased, even at 
points where the given function f(x) is discontinuous. In other words, the 
Fejér series, which is given by S,(x) for v > ©, converges uniformly over 
the entire fundamental range even when the function f(x) which it 
represents possesses discontinuities. 

It is interesting to observe, according to the relations 269, that for 
large values of v the coefficients in the sum S,(x) differ appreciably from 
those in the sum s,,(«) only for the higher harmonics. In other words, the 
coefficients of the initial terms are practically the same in the Fejér sum 
as they are in the partial sum of the corresponding Fourier series when 
the total number of terms is large. This fact means that as the number 
of terms becomes infinite, the coefficients for any finite number of terms 
are actually identical. The two resulting infinite series differ only in their 
terms of infinite order.* 


*This is admittedly a rather loose way of referring to the higher order terms in a partial 
sum as the number of terms in that sum is allowed to increase without limit. 
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From a practical point of view this circumstance may at first sight 
seem trivial, inasmuch as the terms of infinite order can never be reached 
in any term-by-term calculation. Nevertheless, there remains a significant 
difference in the behaviors of the two series, in that one of them exhibits 


a Set Mees 


Fic. 24, Essential difference in the behavior of the partial sum of a Fourier series 
and that of a series of Fejér polynomials in the vicinity of a discontinuity. 


the Gibbs phenomenon at any point of discontinuity and the other does 
not. This difference is due precisely to the difference in the terms of 
infinite order because these are the ones which alone are significant in 
determining the approximation properties of the series in the vicinity 
of a discontinuity. 


15. FouRIER: ANALYSIS BY GRAPHICAL MEANS 


In practice it frequently occurs that the function f(x) for which a 
Fourier series representation is wanted is available in graphical form 
only. Usually also in problems of this sort only a partial sum s,(x) is 
sought which approximates the function f (x) with a certain stated 
tolerance. Before discussing a possible method of solution, it may be well 
to point out that the precise requirements of the desired solution in a 
problem of this kind are frequently not clearly stated, and hence that 
considerable confusion regarding the value of a particular solution may 
result. 

The problem is frequently put very roughly in the statement that 
a finite trigonometric polynomial of the form of s,(x) is sought which 
approximates a given function, but usually nothing is said about the 
approximation properties of the desired polynomial except perhaps that 
the “ best approximation ” which a given number of terms can yield is 
wanted. Inasmuch as there are an infinite variety of ways in which a 
trigonometric polynomial with a given number of terms may approximate 
the required function f(«), the solution is decidedly not unique. For 
example, the polynomial may approximate f(x) so as to make the mean 
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square error a minimum, in which case the solution, according to Art. 12, 
is given by the partial sum of a Fourier series. On the other hand, the 
polynomial may behave in such a way that all the maxirnum discrep- 
ancies which occur between certain of its values and the corresponding 
ones of f(x) are equal in magnitude. In other words, all the maximum 
deviations of the polynomial from the function f(«) are equal. In this 
case the polynomial is said to approximate f(x) with a uniform tolerance. 
This sort of behavior generally yields the smallest tolerance for a given 
number of terms, and it may quite appropriately be said to represent 
a “best approximation,” yet the mean square error does not become 
a minimum. Another reasonably good approximation may be had from 
the Fejér sum S,(x). For it the mean square error is not a minimum 
either, but this should not preclude consideration of it, because there 
usually is no particular reason why a minimum mean square error should 
be the ruling requirement. In other words, since there are numerous 
other approximation behaviors which with equal or better right may be 
regarded to be “ good ” or “‘ best approximations,” there is no a priori 
reason why a Fourier sum should be looked upon as the solution. In fact 
there may be good reasons in ‘certain problems why the Fourier sum 
should be avoided, one such reason being that the Fourier sum yields 
a nonuniform tolerance in the vicinity of a point of discontinuity. 

It is important to remember in connection with these thoughts that 
such a variety of possible solutions is markedly different only when the 
total number of terms considered is not too large, although there are 
always significant differences in the terms of high harmonic order no 
matter how large » may be. This point is illustrated by the comparison 
of the functions s,(x) and S,(x) given in the preceding article. In the 
limit 2— ©, all possible methods of approximation which converge 
toward f(x) must yield identical terms for all finite harmonic orders, 
but for any finite 7 the higher order terms may vary significantly, for 
these ultimately characterize the respective approximation properties of 
various types of partial sums. 

In practice these considerations may raise the question of what har- 
monic amplitudes one may expect to obtain from experimental measure- 
ments made on an unknown source with a detector which is almost 
perfect with regard to harmonic selectivity. Experimental errors being 
neglected, the answer to this question is simple inasmuch as a physical 
source cannot generate harmonics of infinite order and hence its periodic 
function must be given by a finite sum. The perfect detector measures 
the amplitudes of the harmonics in this finite sum. 

Because of the imperfections present in any practical measuring 
system, however, the results of such a measurement yield a partial sum 
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which is not correct but merely approximates with a certain tolerance 
the resultant function represented by the correct one. Since there are 
any number of other partial sums which also approximate the given 
function with the same tolerance, it is difficult to know how to interpret 
the measured values. 

It is common practice to assume that the measured polynomial approxi- 
_ mates the partial sum obtained from a Fourier analysis (or what is 
thought to be a Fourier analysis) of the resultant function f(x) as recorded 
by an oscillograph. One frequently reads in reports on experimental 
work of a comparison between “ calculated ” and “ measured ” values 
of harmonic amplitudes. Such comparisons are somewhat meaningless, 
not only on account of the residual inaccuracies in the calculated and 
measured values, but primarily because there is only a remote chance 
that they correspond to the same approximation behavior in the partial 
sums which they represent. 

For example, the given source function may closely resemble the 
rectangular wave of Fig. 9. Now if this function is approximated by 
a finite trigonometric polynomial in such a way as to yield a uniform 
tolerance, a relatively small number of terms (for example, about 10 or 
15) suffice to reproduce the function with residual discrepancies which 
in ordinary circumstances are well below the threshold of detection on 
a cathode ray screen or on a graphical plot, yet the resulting harmonic 
amplitudes are quite different from those obtained from a graphical 
Fourier analysis of the same oscillographic record with a consistent 
degree of care and accuracy. They also are different from the amplitudes 
obtained from the determination of a Fejér sum which approximates 
this record with tolerances that, for the same graphical or experimental 
accuracy, cannot be detected. These differences are more marked in 
the higher harmonics, but in some cases are noticeable in the lower 
harmonics and even in the fundamental. 

Who can say, then, that the measured harmonic amplitudes. should be 
compared with those of a Fourier sum and not with those in other finite 
approximating polynomials? One may arbitrarily choose the Fourier 
sum as a standard of comparison, particularly since determination of it 
ordinarily involves far less computational labor, but the mistake should 
not be made of regarding this sum as the “ true ” analysis of the experi- 
mentally given function. 

The Fourier analysis of a graphically given function proceeds according 
to principles already discussed in Art. 2. They are now set down in-a 
somewhat more detailed form. 

The given function f(x) is assumed to be plotted for the range 
—x <x <7. The first step is to decompose the function into its even 
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and odd. components f;(«) and fe(x) respectively according to the rela- 
tions 80 and 81. The Fourier series for these components read 
fila) = ao + a1 cos x + ae cos 2% + ag cos.3% + +++ [283] 
fo(x) = by sin x + bz sin 2x + bs sin 3% +--- [284] 
and the corresponding partial sums are written , 


sy -1(4) = do + a, cOS4++++ + Gn, 008 (4 —1)e [285] 


s®,,(x) = b, sinx + by sin 2x +--- + db, sin nx [286] 
The problem is to determine the coefficients in these partial sums so that 
SO na (") & fi(@) [287] 
s,, (x) & fo(a) [288] 
According to Art. 2, the range 0 < x < 7 is divided into ” subranges, the 
centers. of which correspond to the x-values 0 < 4, < 42g < +++ <4, <7 
with 

_ hls [289] 

With the coefficients 
Oke = COSS*, and Bye = Sin sxy [290] 


as defined by Eqs. 52 and 53, the problem reduces to a determination 
of the solution of the systems of linear algebraic equations 


= D eyes = = fi (xx) (k = 1, 2,°°° n) [291] 
and 
= Beads = fo (xu) (k = 1,2,-++m) [292] 


Multiplying these equations respectively by a;, and Bir and. summing 
over & yields 


n—1 z= 
>» = | a, = Xx anf 1(%z) [293] 
8=0 k=l 
and 
ze z PieBs b, = ya Brsfo (xn) [294] 


In view of the orthogonality conditions stated by Eqs. 56 and 57, one 
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finds 
ma. forr=1,2,---"—1 1295) 
n _ r —) 5 f oe nN _— 
Zour (xp) = 42 
nay, forr = 90 
and 
nN 
n ~6, forr=1,2,-::n-1 
z Brrfa(xn) = 42 [296] 
no, forr=n 
Observing that 
Bun = Sin nx, = sin (k — A) = (-1)F1 [297] 
and that 
apg = 1 | [298] 
Eq. 295 gives 
2 n 
a= 2 aad) Lem) 1295) 
and in particular 
1 n 
ao = nol (an) [300] 
whereas Eq. 296 yields 
2 n 
b, a “a ZX Burfa (te) (r = 1, 2, oe SS 1) [301] 
with 
_1¢ kt . 
by == (-1) Yale) [302] 


The last four equations constitute the desired solution. The values 
of f, (xz) and fo (xx) for k = 1, 2,---m are taken from the graphical plots 
for these functions, and the values of the coefficients a,, and fxr are either 
calculated from the relations 290 or read from graphical plots of the sine 
and cosine functions, it being noted that the arguments of these functions 
are (2k — 1)sm/2n. Thus all the values of the coefficients a,, and By, are 
readily taken from a pair of carefully plotted curves for the sine and cosine 
functions over a 90-degree interval. In this way the solutions given by 
Eqs. 299 to 302 are quite rapidly evaluated for reasonably large values 
of n. , 
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It should be observed that the values of the coefficients a, and 6, thus 
obtained, when substituted into the partial sums 285 and 286, yield func- 
tions which agree respectively with f; (~) and fo (x) at the points x1 --- 2%, 
and, of course, also at the points which correspond to the negatives of 
these x-values. Over the small ranges between these points the values of 
the partial sums do not agree with the corresponding ones of the functions 
fi(%) and f2(x), but if m is chosen sufficiently large it is reasonable to 
expect a fairly good approximation between the partial sums and the 
functions which they are to represent. 

If the functions f;(x) and fe(x) are relatively smooth, the points 
%1°-+*%», may be spaced farther apart; if the given functions are very 
irregular, the spacing of these points must be smaller. At all events the 
spacing must be sufficiently close to take account of the most rapidly 
varying portions of the given functions; otherwise such variations cannot 
be expected to be even approximately reproduced by the resulting partial 
sums. 

Of the nature of the resulting approximation not much can be said 
inasmuch as this depends largely upon the particular characteristics of 
the given function and the chosen spacing of the points x; - + + x,. In order 
to conserve computational labor, this spacing is ordinarily chosen as large 
as possible, in which case the coefficients in the vicinity of the terms of 
highest order cannot be expected to be even approximately equal to the 
true Fourier coefficients. However, in view of the discussion given earlier 
in this article, it does not necessarily follow that these coefficients are 
practically of less value than the others. 

In various practical problems, methods of determining finite trigono- 
metric polynomials which exhibit controllable approximation properties 
would be highly desirable. Of particular interest in this respect would be 
a method of determining a polynomial which approximates the given 
function with a uniform tolerance, since this type yields the closest 
approximation for a given finite number of terms. Such methods are 
desirable not only for use with graphically given functions but also for 
analytically given ones as well. Unfortunately, these questions have as 
yet received little attention. 


16. RELATION TO THE BESSEL FUNCTIONS; SOMMERFELD’S 
INTEGRAL 


The problem of determining the spectra of frequency or phase-modu- 
lated sinusoidal time functions, which occurs in the consideration of 
communication signals for which this type of modulation is used, reduces 
to the determination of the Fourier series representation for the functions 
cos (p sin x) and sin (p sin x), in which p is a parameter. Since these are 
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the real and imaginary parts of the function a 
f(x) = e** = cos (p sin x) +f sin (p sin x) [303] 


one may obtain both the desired Fourier series by considering this com- 
plex function alone. Incidentally, it is to be observed that this is a complex 
function of the real variable x and not a function of a complex variable. 
No extension of the Fourier series to functions of a complex vanes is 
involved here, although such an extension is possible. 

The Fourier series for the function 303 is advantageously written in 
the complex form 


f(x) = EL anei™* [304] 


n=— 0 


in which 
=P [liec™ Hiak as [donee d 305 
"Qa do Ie Jo i [305] 
This integral is a special form of the more general complex integral 
Zy(o) = + fetomicint = ag [306] 


which Sommerfeld has shown* to be capable of representing cylinder 
functions of all kinds according to the specific choice made for the path 
of integration in the complex ¢-plane. 

For the present problem it is sufficient to observe that the form which 
the integral 306 takes for the representation of cylinder functions of the 
first kind (Bessel functions), with the order p equal to an integer 1, is 
(except for a factor 4) identical with the integral 305. This function is 
usually denoted by Jn(p), thus, 


1 ve at 
On = In(p) ae f gi (e sin z—nz) dx [307] 
21 Jo 


The restriction, that this integral representation for the Bessel function 
is valid only for integer values of , is not violated in the problem of 
Fourier expansion considered here. 

Numerous variations in the form of the integral 307 are possible, a few 
of which may be worth mentioning. For example, making the change of 
variable x —> x + 2/2 and noting that the limits on the integral may, 
because of the periodicity of the integrand, be changed arbitrarily as long 


*Math. Ann., 47 (1896), 335. Also in Riemann-Weber, Differentialgleichungen der Physik, 
Vol. II, p. 454 (Vieweg, 1927). 
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as the integrationgxtends over the fundamental range of 27, one obtains 


2x 


1 ; : 
Jue) = ae gene TE) ge [308] 


or, replacing x by —x (including dx by —dx) and noting that the limits 
become 0 to —2z, but that 


—Qar 7 
ie ie | _ bel 
one may also write 


ar Q - 
In(p) = ~S eh a kyinie—s i) Pe [310] 


which is the form obtained from the general integral 306.* If it is observed 
that 


eri ays [311] 
this last form is seen to be expressible as 
jy fan. P 
Hija! = t gi? On zeine iy [312] 
2r vo 
Noting that 
EF OB Heinz — IP OBE cos yy 4 Fei? 8% cin ny [313] 


and that the first of these two terms (as a function of x) is even while 


_ the second is odd, one finds that the integration in Eq. 312, which may 


alternatively be extended over the range —z to «, yields nothing for 
the second term and that the result for the first term may be written 
in the form ; 


Jilp) = (a A e7? 8% cos nx dx [314] 
Tr 

Applying a similar process of manipulation to the integral 307, in which 

the terms of 


ef sin2—n2) — cos (p sina — nx) + jsin (psinx ~ nx) [315] 


are observed to be respectively even and odd, one finds still another 
representation which reads 


1 
In(p) = -[ cos (p sin x — nx) dx [316] 


*The Bessel function Jp(o) may be expressed as Jp(o) = {Hp (p) + Hp (p)] in 
which the Hankel functions Hp“ (p) and Hp (p) are given by the integral 306 for particular 
paths in the ¢-plane, and the Bessel function J. p(p), through choosing the resultant of these 
two paths. A more detailed discussion is given in Art. 26. 


} 
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From this form it is easily seen that J,(p) is real for real values of p. 
The form given by Eq. 314 is convenient for establishing the relation 


J_n(e) = (—1)"Jn(e) [317] 


The Fourier series representation for the function 303 is, according to 
Egs. 304 and 307, given by 


erting =. Ta(p)ein - [318] 


n=—o 


Writing out the indicated summation, one has 


ef8in= — Jo(p) + 2[Jo(p) cos 2x + J4(p) cos 4x +---] 
= 2j[Ji(p) sin x + J3(p) sin 3x +--+] © 


a WE 
Zeek eee saeee 
ae 
ee ole Nee eye 
Sees RRS AZSE 
PE NZ a eed 
CEPR 
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Fic. 25. A plot showing the closeness of approximation to the Bessel function of 
order zero by the function of Eq. 322. 


- 0.6 


Separation into real and imaginary parts yields 
cos (p sin %) = Jo(p) + aL To(p) cos 2x + J4(p) cos4a ++++] [320] 
and : 
sin (p sin x) = 2[J,(p) sin x + Js(p) sin 3a +--+] [321] 
which are the desired relations. 
Regarding the numerical calculation of the coefficients in these Fourier 


series, it is useful to observe from the plots in Figs. 25 and 26 that the 
functions Jo(*) and Ji(x) are approximated sufficiently closely for 
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most engineering purposes by the expressions 


eee (« a *) 
08). [322] 
2 x 
and 
= 
¥ sin ( = *) 
1(4) & ——=— [323] 
5 x 


for values of x larger than about 4. In connection with the frequency or 
phase modulation problem the arguments of the Bessel functions are for 


‘. us 
sin (x— 4) 


asa (2) 2 Tas = 
V2 


a NIA la 
Ei a = Dm 


1 2 3 4 5 6 7 8 9 io UU 2 2B 14 


Fic. 26. A plot showing the closeness of approximation to the Bessel function of 
order one by the function of Eq. 323. 


the most part at least as large as this value or much larger, so that it is 
seldom necessary to consult the tables or curves for these functions. 
The functions of higher order are readily calculated in terms of Jo(x) 
and J,(x) from the recursion formula* 


FeasaS - Lia Tease [324] 


*The derivation and discussion of this formula may be found in various well-known books 
on the subject of Bessel functions. The approximate expressions 322 and 323 are derived 
in Art. 26. 
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17. FouRIER SERIES IN TERMS OF MORE THAN ONE VARIABLE 


The Fourier series representation of a periodic function may readily 
be extended to functions of more variables. For example, if f(x,y) is 
periodic in both the variables x and y with the fundamental ranges 
0 <x S 2r and 0 S y S 2z, and if, throughout the rectangular region 
thus defined, the derivative 


9? 
Ox r : [325] 
is finite and continuous, the series 
fey = XL ap,ierm [326] 
v= 2 


is absolutely and uniformly convergent throughout the stated region 
and there represents the function f(x,y) in the Fourier sense. 

As in the case of the Fourier series for a function of a single variable, 
the derivative 325 or the function f(x,y) need not be continuous at all 
points of the region. The representation 326 is still possible as long as 
the Dirichlet conditions (see latter part of Art. 7) are satisfied in both 
the variables, but the series is then no longer uniformly convergent 
throughout the entire fundamental region. 

The coefficients in the complex series 326 are given by the formula 


ee ae ee f ss = juste) 
awa f &-f aiese [327] 


which is a straightforward extension of.the formula applying to functions 
of a single variable. Derivation of it may be assumed to proceed by con- 
sidering the function f(x,y), for the moment, for a particular value of y. 
That is, for y = constant, f(x,y) is a function of x only, and a Fourier 
series representation is possible in the form 


f(xy) = x= A, (y)e%#* [328] 
“a =_— oO 
in which the coefficients 


Qa ; 
Any) = 52 [fone de [329] 


are functions of the parameter y. 
For any integer value of u, A,(y) may be represented by a Fourier 
series in the variable y, thus 
Au) = ZX awe” [330] 


p= 


' 
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with coefficients 
1 ar . 
oer = 5 f A,ly)eo" dy [331] 


Substituting Eq. 330 into Eq. 328 yields the double sum 326, which is 
the desired expansion for f(x,y); and substituting Eq. 329 into Eq. 331 
yields the formula 327, for the resulting Fourier coefficients. 

Thus the Fourier expansion may readily be extended to functions of 
any number of variables. 


18. FREQUENCY GROUPS 


In the present discussion a finite group of simple harmonic time func- 
tions (frequency components) is assumed to be given, and an inquiry is 
directed toward determining the nature of the function defined by linear 
superposition of them. Although this problem has much in common 
with the consideration of the properties of the partial Fourier and Fejér 
sums discussed in Arts. 13 and 14, the present interpretations are directed 
toward an entirely different goal, namely, toward a means for represent- 
ing, in a closed form, functions which are not necessarily periodic yet 
for which the defining range extends over the entire region of the inde- © 
pendent variable from minus to plus infinity. 

To begin with a simple example, it is assumed that three simple 
harmonic functions with any finite amplitudes are given, having the 
frequencies 100, 125, and 150 cycles per second. The relative phase 
angles are for the moment immaterial. The resultant function given by 
the sum of these three components is periodic, its fundamental frequency 
being 25 cycles per second. This conclusion is clear from the fact that 
25 is the highest common factor (HCF) of the group of numbers 100, 
125, and 150. The fundamental period is one-twenty-fifth of a second. 
Throughout this interval the 100 cycles per second component completes 
4 cycles, the 125 cycles per second component completes 5 cycles, and 
the 150 cycles per second component completes 6 cycles. The original 
state of affairs is then re-established because each component has com- 
pleted a whole number of cycles. This ‘statement is true no matter what 
the relative phase angles of the components may be. 

In the language of the Fourier series, the resultant function is repre- 
sented by its fourth, fifth, and sixth harmonic components alone. All 
other components, including the fundamental, are absent. It becomes 
clear that the linear superposition of a group of simple harmonic com- 
ponents yields a periodic function only if their frequencies have a com- 
mon measure. This means that the frequencies must be given by rational 
numbers or be rational multiples of the same irrational or transcendental 
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number. If several different irrational numbers (like 2) or trans- 
cendental numbers (like or the naperian base e) are contained in the 
group of frequencies, the resultant function never exactly repeats its 
sequence of values; that is, its period is infinite. The same is true if 
rational, irrational, and transcendental numbers in any combination 
are contained in the group of frequencies. 

For example, the function 


S(é) = cos 100¢ + cos (100 + w)é [332] 


never repeats its sequence of values. This situation should not be con- 
fused with the well-known circumstance that the function given by 
Eq. 332 can be interpreted as a beat phenomenon through conversion 
of the right-hand side of this equation to the form 


f® = 20s : t+ cos (100 + *) t [333] 


which is customarily plotted through considering the slowly varying 
function 2 cos (#/2)¢as an envelope containing the rapidly varying simple 
harmonic function cos (100 + #/2)#. Each half period of the function 
cos (x/2)é is commonly referred to as the “ beat period,” but this is a true 
period only if the difference between the two frequencies involved in the 
expression 332 is at the same time their highest common factor. 

Even when the two frequencies are rational numbers the beat period 
is not necessarily a true period. For example, the frequencies 100 cycles 
per second and 103 cycles per second give rise to a beat frequency of 
3 cycles per second, but the true fundamental frequency is 1 cycle per 
second. In other words, the exact pattern of the resultant function does 
not repeat until three beat periods have elapsed. The well-known experi- 
mental fact that the human ear (under proper circumstances) appears to 
recognize the beat frequency as though it were actually present in the 
form of a separate component is a physiological phenomenon due in 
part to a nonlinear characteristic in the response mechanism of the ear, 
and this is not to be confused with the strictly linear superposition of 
component frequencies considered here. 

It is interesting as well as instructive to generalize the problem of the 
simple beat phenomenon by inquiring how the interference pattern 
looks when more than two frequency components are superimposed. 
‘Incidentally, one must recognize that the familiar beat pattern in the 
case of two frequencies is pronounced only when the increment between 
these frequencies is small compared with either one. When the two 
frequencies and their difference are of the same order of magnitude, the 
conversion from the form of Eq. 332 to that of Eq. 333 is, of course. 
still valid, but the two functions whose product is represented by Eq. 
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333 then vary at about the same rate and the beat character of the result- 
ant function is lost. 

In generalizing the problem of the beat phenomenon it is, therefore, 
essential to assume a group of simple harmonic components whose spacing 
in the frequency spectrum is small compared with the mean frequency 

of the group. The line spectrum of 

>Su}< such a group containing seven com- 

Soto ht cr ponents is illustrated in Fig. 27. The 

1 mean angular frequency is wo. The 

adjacent frequencies are wo + dw, wo 

— 6, 9 + 26w, wo — 25, +++ and 

so forth, their uniform spacing 

Fic. 27. The line spectrum of a fre- being equal to éw. 

quency group. In general, a frequency group of 

this sort is considered to consist of 

n components, and for simplicity all the amplitudes are assumed to be 

equal and all the phase angles zero. It is further expedient to set the 
common amplitude equal to 1/n. The width of the group is defined as 


Aw = niw [334] 


The analytic expression for the group then reads 


COS wot + COS (wo + dw)t + COS (wo + 2dw)t 
4-4 008 (0 + 25 40) 


1 2 
f@) =- [335] 
2) + cos (wo — dw)t + cos (wo — 2dw)t 
—1 
+++ 008(00 =" 9 iw) ( 
By repeated use of the trigonometric identity 
cos (a + b) = cosacos b + sina sin b [336] 
this expression may be put into the form 
2coswot {1 - 
{®H= — . {5 + cos dwt+cos 2dwi+ +++ + cos "Se sut| [337] 
With the formula expressed by Eq. 11 this result may be written 
. ndw 
sin —— ¢ 
f@) = ———— - 0s aot [338] 
bw 


n sin ~t 
sin = 
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Here the function 


- Ndw 
sin a t 
F(t) = ———— [339] 
5 ad 
nsin— ? 
sin> 
is slowly variable, compared to the mean frequency component Cos wolf, 
and may be regarded as an envelope function enclosing this mean fre- 
quency. The beat phenomenon is placed in evidence by the envelope 
function just as it is in the simple case of two interfering components. 


38 ode 2m | 2m i 2m i am am 
Aw dw Aw Aw dy; Aw Aw 


Fic. 28. The interference pattern of the frequency group given in Fig. 27. 


This envelope function is plotted in Fig. 28 versus the time ¢ for the 
case n = 7. The interference pattern for the group of frequencies whose 
line spectrum is given in Fig. 27 is thus illustrated by Fig. 28. The regions 
of constructive interference, which lie in the vicinities of the main humps 
of the envelope, are spaced at intervals of 


2 
T= e seconds [340] 


The duration of each main hump is 47/Aw seconds. The group period 
or beat period 7, is, according to Eq. 340, inversely proportional to the 
frequency increment dw, and the duration of a region of constructive 
interference is inversely proportional to the width Aw of the group. If 
this width is kept constant while more components are added to the 
group, the regions of constructive interference occur at longer intervals 
but their duration remains the same. The number of smaller humps be- 
tween the large ones also increases; but, for a large number of component 
frequencies, the amplitudes of these smaller humps become insignificant 
midway between the large humps and in this vicinity. 

The fact that the interval between regions of constructive interference 
is inversely proportional to the frequency increment 6w leads to the con- 
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clusion that, as this increment is allowed to approach zero, the beat 
period grows without limit. The frequency group finally becomes con- 
tinuous, and the resulting function has only one region of constructive 
interference in the entire time scale from minus to plus infinity. This 
reasoning assumes, of course, that each frequency component in the 
group endures throughout the entire time scale; that is, it represents a 
true steady state component: 

The limiting form for the envelope function F(é), resulting from 
letting 5w approach zero, is readily evaluated. Since the width Aw of the 
group remains constant, 2 and 6w must vary inversely as the limit 
n— ©, dw» 0 is carried to completion. The trigonometric sine in the 
denominator of Eq.:339 may be replaced by its argument, so that the 
limiting form of this function is readily seen to be 


sin Be t 
1 —_ 
2 


F(t) = [341] 
Od 
z Zz 
This is the function discussed in Art 13 and plotted in Fig. 17, except 


that here 


1 = at [342] 
2 
The resultant group function 
sin = t 
“fOQ= COS wot [343] 
= , 


is now entirely transient in nature; that is, it never repeats, but has only 
one region of constructive interference. Beyond this region the compo- 
nents in the group (which are now infinite in number) interfere de- 
structively forever. 

Since the phase angles of all the components are chosen equal to zero, © 
the region of constructive interference lies at the time origin. If in the 
argument of each cosine term in Eq. 335 a phase angle —¢ is inserted 
which has the following frequency dependence 


$(w) = (w — wo)lo [344] 


the variable ¢ in the above formulas for the envelope function is replaced 
by (# — to). The region of constructive interference then occurs in the 
vicinity of the point t = fp. If the phase angles of the components in the 
group are given random values, the form of the. resulting interference 
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pattern becomes very difficult to determine and in general exhibits no 
well-defined region of constructive interference. 

The fact that a continuous group of frequency components gives rise 
to a resultant function having a transient nature suggests that if a 
similar limiting process is carried out with the Fourier series, a means 
will be obtained for analytically representing an arbitrary. nonperiodic 
function. This suggestion is followed in the following article. 


19. Tue FourrIER INTEGRAL 


The essence of the discussion in the preceding article may be sum- 
marized thus: Whereas the linear superposition of a group of discrete, 
uniformly spaced, frequency components (finite or infinite in number) 
gives rise to an interference pattern of a periodic nature, this pattern 
assumes a transient character when the frequencies in the group are 
continuously distributed. The resulting so-called continuous spectrum 
should be thought of as a line spectrum in which the spacing of the lines 
is allowed to approach zero, and the transient character of the resulting 
time function may be thought of as the limiting form of a periodic func- 
tion for which the period has become infinite. That these two limiting 
" processes are consistent is evident from the fact that the spacing of the 
lines in the spectrum of a periodic function equals its fundamental 
frequency, which is the inverse of its period. It may further be helpful in 
this connection to recognize that the period (which is the reciprocal of the 
spacing of the lines) is equal to the line density expressed as the number 
of lines per cycles per second. As the period is allowed to grow without 
limit, the line density grows without limit, so that finally the spectrum 
becomes a continuous one and the function never repeats; that is, it 
becomes a transient function. 

‘By the carrying out of this limiting process in terms of the Fourier 
series Eq. 175 and the relation 176 for its coefficients (spectrum function), 
an analytic means is obtained for the representation, in a closed form, 
of an arbitrary transient function. The value of such a mathematical tool 
in connection with various engineering problems is readily appreciated. 

Repeating, for convenience, the mathematical statement for the 
periodic case* 


f®= XT ane [345] 
x few: 2 
a, = [fe ine at [346] 
Lx — fay, 4 


' *The fundamental angular frequency is in the present article denoted by w in order that 
the symbol w may be used to denote any arbitrary angular frequency. g 
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one should observe that the limiting process indicated. by s— © or 
w, > 0 evidently is accompanied by a, — 0. That is, as the line spacing 
in the spectrum is allowed to become smaller and smaller, the amplitudes 
of the harmonic components also become smaller and smaller. However, 
it may be expected that the ratio a,/w, approaches a finite limiting 
function. Thus, before carrying out the limiting process, it is expedient 
to rewrite Eqs. 345 and 346 in the form 


0) = E(t) emma) [347] 
(2) = [poem dt [348] 


In Eq. 347 the symbol 6(w,) stands for the increment in the variable 
nw, Which is equal to the line spacing and hence equal to w}. 
The limiting process is now formally indicated by 


To 


0 ‘ 
nes a | (nos) 


5(nw) —> dw [349] 


(2) — g(w) 


Here it should be recognized that a new variable for the frequency, a 
continuous variable, must be introduced to take the place of the dis- 
continuous variable 2w,. This new variable (denoted by w) refers to any 
finite angular frequency in the continuous spectrum just as the discon- 
tinuous variable nw, does in the line spectrum. The introduction of the 
new symbol may be regarded as a convenient way of avoiding the ap- 
pearance of the quantities 7 and 1, which become improper in the limit. 
However, w = mw, remains a proper variable and still refers to the 
frequency of any harmonic component in the limit precisely as it does 
before this limit is carried to completion. 

If this limiting process is thought of as being carried out in steps 
through doubling and redoubling of the period 7, it becomes clear that 
each time 7 is doubled, any specific harmonic component doubles its 
order n. If attention is fixed upon a specific harmonic frequency, » and 
w, must vary inversely (that is, mw, must remain constant) as the period 
is increased. The frequency increment 6(mw;) in the limit is formally 
replaced by the differential dw, and the ratio ap/w, becomes a finite 
function g(w) of the continuous frequency variable w. This function 
expresses the variation of the harmonic amplitudes in the limit. 

It remains to recognize that as the limiting process is carried to com- 
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pletion, the summation in Eq. 347 becomes an integration. The final forms 
of the relations 347 and 348 are 


{®= f ‘ g(a) du e7*! [350] 


lo) = = [Fw ater (351] 


This heuristic derivation of the relations 350 and 351 does not establish 
their correctness on a rigorous mathematical basis, but from an en- 
gineering point of view a rigorous proof may properly be omitted since 
the principal interest lies in the interpretation and use of these forms. 

It may be pointed out once more that the complex harmonic ampli- 
tudes are not given by g(w) but by g(w) dw. Since g(w) is finite, and dw is 
the symbolic representation of a quantity which is regarded as becoming 
vanishingly small, the harmonic amplitudes are also vanishingly small. 
However, just as 5(mw,) denotes a constant spacing in the line spectrum 
of a periodic function, so dw, which represents 5(mw,) as the limiting 
process is carried to completion, must, at any stage of this process, 
likewise be regarded as a constant. Hence g(w) is proportional to g(w) da, 
so that a plot of g(w) versus w shows the correct variation of the harmonic 
amplitudes with frequency even though all these amplitudes are vanish- 
ingly small. 

Equation 350 is called the Fourier integral representation for the 
function f(é). The function g(w) is called the Fourier transform ot f(t) 
and, reciprocally, f(#) is called the inverse Fourier transform of g(a). 
The second of the pair of integrals 350 and 351 transforms a time func- 
tion f(¢) into its equivalent frequency function g(w), and the first of these 
integrals reverses the process. The second integral analyzes the time 
function into a spectrum, and the first integral synthesizes the spectrum 
to regain the time function. g(w) represents the function in the frequency 
domain just as f(#) represents the function in the time domain. One may 
also regard Eqs. 350 and 351 as representing simultaneously a pair of 
integral equations and their mutual solutions. 

The graphical interpretation of the entire process of Fourier analysis 
and synthesis in terms of the frequency spectrum as discussed in Art. 9, 
as well as the method of manipulation of the forms 350 and 351 in con- 
nection with various physical and mathematical problems, remains 
exactly the same as for the Fourier series. Hence there is nothing new 
to be learned in this respect. The essential difference lies only in the fact 
that the spectrum function is continuous and that the synthesis of the 
spectrum is accomplished by means of an integral instead of a sum. This 
latter difference is actually an advantage because more formulas are 
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available for the evaluation of integrals than for the evaluation of sums. 
The principal advantage, however, lies in the fact that the Fourier 
integral is capable of representing transient functions. 

The conditions under which this representation is possible are es- 
sentially the same as those which apply to the representation of a periodic 
function by means of a Fourier series. These are the Dirichlet conditions 
as pointed out in Art. 7. A detailed difference in the form of these condi- 
tions arises from the fact that the fundamental range is now infinite 
instead of being finite. In view of this difference, the condition 136 reads 


f 7 | f(é)| dt shall be finite [352] 


At a discontinuity of the function f(t) the Fourier integral (like the 
Fourier series) also yields the arithmetic mean between the two im- 
mediately adjacent values of the function (as stated for the Fourier 
series by the relation 137). 

The approximation properties of the Fourier integral are likewise the 
same as those of the series. In order to show this, one may consider the 
integral 350 for finite limits. Thus, the function 


s(t) = fi) du eft [353] 


is the analogue of the partial sum of a Fourier series, since it represents 
the synthesis of a finite portion of the spectrum. Substituting for g(w) 
the expression given by Eq. 351, and using @ in place of the variable #, 
one has after interchanging the order of the two integrations 


1 ad o. 
s(t) = a f (0) de f e JOD dy [354] 
Writing for the exponential 
eF#) = cos w(9 — t) —j sin w(6 — 2) [355] 
and observing that the cosine is even while the sine is odd, one finds that 
[iO do = 2 ff 00s 0(@ — 1) de [356] 
and hence that Eq. 354 may be written 
1 oo 
S(t) = MEO) do f° cos w(6 — t) dw [357] 
But 
sin a(@ — #) 
fs w(6 — t)dw = ne [358] 
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so that 


ir sin a( — ¢) 
)=- ff) ao 
== f 10 [359] 
which is analogous to the integral 245 for the partial sum of a Fourier 
series. 
In order to illustrate the application of this formula one may consider 
the function defined by 


e 


f(@) =0 for a<-3 
3 5 
f@) =1 for 5 <6< 5} [360] 
6 
f(@) =0 © -for a> 3 
Then the integral 359 yields 
= =a sin a(@ — #) 
s(é) = Sn Oe do [361] 
which may be written 
a(5/2—21) gj 
s@) => fr aw [362] 
wedJ—a(/2+t) & 
or 
4 fra/2—) gj —a(6 /2+#) gj 
sox 1 c2—)sinu 1 sin vo, [363] 
Jo x JO u 


Utilizing the definition of the Si-function according to Eq. 257, and 
observing that Si (—x) = —Si (x), one obtains the result 


s() = ‘si é ( 5 ;) ~ Sia (« S =k [364] 


Figure 29 shows a plot of this result for the choice a = 167/65. The 
separate Si-functions are also drawn (dotted) in order to illustrate more 
clearly how the resultant function is obtained. The similarity is very 
evident between this result and that shown in Fig. 21 for the partial sum 
of a Fourier series representing the periodic repetition of the present 
function f(6). In fact, the approximate evaluation of the Fourier sum 
s, (a) for large values of » given in Art. 13 in connection with the problem 
illustrated in Fig. 21 likewise leads to an expression in terms of the 


Si-function. 
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The characteristic 18 per cent overswing at the points of discontinuity, 
yielding in the limit a > the Gibbs phenomenon, is again in evidence, 
just as it is for the Fourier series. Equation 364 and the corresponding 
plot in Fig. 29 also illustrate the fact that the value of the Fourier integral 
at the points ¢ = +6/2 equals the arithmetic mean between the two 
values of the function immediately adjacent to these points of 
discontinuity. 
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Fic. 29. The Gibbs phenomenon and the arithmetic mean property are characteristic 
of the Fourier integral as well as of the series. 


20. ALTERNATIVE FORMS IN WHICH THE FOURIER INTEGRALS MAY 
BE WRITTEN 


The pair of mutually inverse integral relations 350 and 351 may be 
written in a variety of different forms, with some of which it is well to 
be acquainted. One of the astonishing features about these relations is 
their almost complete identity in form. Thus, the integral 351, except 
for an interchange of the variables, w and #, differs from the integral 350 
only in the appearance of the factor 1/2m and the reversed algebraic 
sign in the exponent of e. The first of these differences may be removed, 
if removal seems desirable, by redefinition of the transform of f(¢) as 


8*(w) = V Ing (w) [365] 
The relations 350 and 351 then assume the more symmetrical forms 
= 1 a a foot 
f@ a oe g (w) dw e? [366] 


1 7 Jet 
ai oe f ft) de [367] 
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Alternatively, the appearance of a factor before these integrals may 
be entirely avoided through considering the transform tq be a function 
of the cyclic frequency f (in cycles per second) instead of the angular 
frequency w = 2af. Inasmuch as dw = 27 df, it is readily seen that when 


a(f) = 2xg(w) . [368] 
the integrals 350 and 351 become respectively* 
we) =f af) af eet" [369] 
and 


a) = fa err [370] 


21. SPECIAL FORMS FOR THE FOURIER INTEGRALS WHEN THE GIVEN 
FUNCTION IS EVEN OR ODD 


If the Fourier integrals 350 and 351 are written in the more explicit form 


f{O= f g(w) {cos wf + 7 sin wt} dw [371] 
and 
gw) = = ff O feos wt — 7 sin wt} dt [372] 


then, if f(#) is an even function of ¢, the second term in the integrand of 
372 contributes nothing to the value of this integral, and if f(t) is an odd 
function of #, the first term contributes nothing. In the first of these cases, 
moreover, g(w) must be an even function of w since the cosine is an even 
function, and in: the second case, g(w) is an odd function because the 
variable w is then contained only in the argument of the sine. Hence it 
follows that if f(é) is even, g(w) is also even, and the Fourier integrals 
read 


fO= f Z &(w) cos wt dia [373] 


gw) = an XO) cos wt di [374] 


*A different symbol is used here to denote the time function in order to avoid confusion 
with the symbol f used for the cyclic frequency. 
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If f(é) is odd, g(w) is also odd, and the Fourier integrals have the form 
f) = J elo) sin ot de [375] 


oe = fl i@ snot at [376] 


In both these cases the integration may alternately be extended over 
the range 0 to © and the integrals multiplied by 2. This factor of 2 may, 
if desired, be absorbed by the function g(w) in the integrals 373 and 375, 
whence the factors for the integrals 374 and 376 become 1/z instead of 
1/2m. The factor 7 appearing in the forms 375 and 376 evidently need 
not appear if the transform of f(t) for this case is defined as jg(w) and 
is dénoted by a single symbol. 

These results show that if the given function f(¢) is real (although 
this is usually the case in practice, the validity of the Fourier integrals 
does not require it), g(w) is real when f(¢) is even, and purely imaginary 
when f(é) is odd. In general, for real functions f(¢), one may decompose 
this function into its even and odd components fi (¢) and f2(¢) respectively, 
according to the relations 80 and 81, and then have for the corresponding 
transform 


g(w) = gi(w) +fge(w) . [377] 
in which . 
Oe x [Ao de = x [CAO cosutds — [378] 
and 
° » 1 ° 
Poe ne [pO det = = [of sin otdt [379] 


22. Some ELEMENTARY PROPERTIES OF THE FOURIER TRANSFORMS 


In the Fourier integrals 350 and 351, if the algebraic signs of both 
variables wand ¢ are reversed (w replaced by —w and t by —?), the forms 
of these integrals remain unchanged except for the appearance of f(—?) 
in place of f(#) and g(—w) in place of g(w). To see this, one should note 
first that changing the signs of both and ¢ leaves the exponents of e 
unchanged; and, second, that the signs of dw and di do change but so 
do the signs of the infinite limits, and since this amounts to an inter- 
change of the limits, the resulting signs of the integrals are the same as 
before. Hence, one obtains the result: 

If g(w) is the transform of £(t), then 
g(—w) is the transform of {(—t) [380] 
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In the general case in which both f(t) and g(w) may be complex, it 
should be observed that both sides of Eqs. 350 and 351 assume their 
conjugate values if the functions f(#) and g(w) are replaced by their 
conjugate values f(t) and Z(w), and if, in addition, the algebraic sign of 
either w or ¢ (not of both) is changed. This change is equivalent to chang- 
ing the sign of 7 in the exponent of e. The statement, therefore, follows 
that: 


If g(w) is the transform of £(t), then 


&(=a) is the transform of #(+t) [381] 


As pointed out in Art. 20, the integral 350, which performs the inverse 
of the transformation given by the integral 351, is almost identical to 
the latter in form. If it were identical in form, one could omit entirely 
the distinction between the transform and the inverse transform of 
a given function, since two successive applications of the same trans- 
formation would yield the function itself. That is, the variables w and ¢ 
might be interchanged, or in other words f(t) and g(w) might be regarded 
as each other’s transform. 

Although the actual situation is not quite so simple as this, it may be 
observed that the forms of the two integrals do become interchanged if 
(a) Eq. 351 is multiplied by 2x, (6) the integral 350 is multiplied by 
1/2m and the function g(w) under this integral sign by 2x, and (c) the 
algebraic sign of either w or ¢ is reversed. One may then interchange the 
variables w and ¢ and regard g as the given function and f as its transform 
or equivalent frequency function. This conclusion is summarized by 
the statement: 


If g(w) is the transform of {(t), then 


f(-:w) is the transform of 2rg (=) [382] 


In particular, if f(¢) is even so that g(w) is even also, the process of 
changing the algebraic sign of either w or ¢ may be omitted. It is then 
true that if g(w) is the transform of f(#), one may reciprocally regard 
2ng as a given function of time and have f represent the corresponding 
frequency function. 

The statement 382 may be given an even simpler form if it is made in 
terms of the functions f(#) and g*(w) defined by the integrals 366 and 367, 
or in terms of the functions /(¢) and g(f) defined by the integrals 369 and 
370. The factor 24 then does not mar the almost complete reciprocity 
existing between the pair of functions involved in the statement. 

Some problems make it convenient or necessary to change the time 
scale for the given function f(é) by some factor; it is then desirable to 
know the effect of this change on the corresponding transform. In order 
to see the effect, one replaces the variable ¢ in the integral of Eq. 351 by 
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at. As a result, the differential dt becomes replaced by a times dt and the 
exponent of e becomes —jwat. If now one replaces w by w/a, there results 


7#(¢) 7 + fcr (at) dt e~7+* [383] 


a°\a 
Hence it follows that: 
If g(w) is the transform of £(t), then 


*e(2) is the transform of {(at) [384] 

This statement is true only if the factor a is positive. For example, 
for a = —1 the statement 384 is evidently incorrect because it conflicts 
with the statement 380. This conflict arises because the algebraic signs of 
the limits change when a is replaced by —a. 

Another useful result follows from the observation that if the function 
f(t) in the integral 351 is replaced by the product f(é).- e=/e0t, and the 
“two exponentials in the resulting integrand are combined into one 
exponential with the exponent —j(w =F wo)é, the effect upon the function 
g is merely to replace its independent variable w by (w + wo). Hence: 


If g(w) is the transform of f(t), then [385] 
g(w =F wo) is the transform of £(t) -e* 


The latter function is the complex form of a sinusoidal function of 
angular frequency wo whose amplitude is modulated by the function f(t). 
The corresponding spectrum function is seen to be the same as that for 
f(t) except for a translation equal to the value of wo. Utilizing this result 
together with the principle of linear superposition, one may readily obtain 
the spectrum functions corresponding to f(t) - cos wot or f(t) - sin wol. 

The complement to the statement 385 is obtained through assuming 
the function g(w) in the integral 350 to be replaced by g(w)e*#. Again 
combining the exponentials, one finds that: 


If g(w) is the transform of {(t), then 


g(w) - e+ Zs the transform of f(t + to) [386] 


This result was observed in connection with the Fourier series in Art. 4, 
namely, that a displacement of the time function merely amounts to 
adding increments to the harmonic phase angles which are linearly 
proportional to the respective harmonic frequencies. 

Other useful relationships between the transforms are found through 
observing that it is permissible to differentiate or integrate with respect 
to the parameter contained in the integrand of either of the integrals 
350 or 351 as long as the resulting functions involved in these expressions 
still fulfill the conditions for Fourier integral representation. For example, 


Art, 22) ELEMENTARY PROPERTIES OF FOURIER TRANSFORMS 527 
differentiating both sides of Eq. 350 with respect to ¢ yields 
a me f ” jeg) doe [387] 
Repeating the process times, one finds 
fd) = J Gayreo) deo oft [388] 


from which one may conclude that: 


If g(w) is the transform of {(t), then (jw)"g (w) 


is the transform of the nth derivative of f(t) [389] 


The function given by this mth derivative of f (#) must, of course, still 
fulfill the requirements for which its representation by means of the 
Fourier integral is valid. If the function given by the integral of f(4), 
namely, 


Fe) = {fo a [390] 
also fulfills these conditions, then since 
Al [soa] =F =10 [301] 


it may be inferred from the statement 389 that: 
If g(w) is the transform of {(t), then 


t [392] 
2(w)/jw is the transform of f £(@) de 
Under the same conditions, this statement may be extended to functions 
formed by successive integrations of f (t). 
Similarly, by differentiating or integrating Eq. 351 with respect to w, 
one finds that: 
If g(w) is the transform of f(t), then the nth 


derivative of g(w) is the transform of (—jt)"£ (t) [393] 


or: 
If g(w) is the transform of f(t), then 


ip “g(u) duis the transform of £(t)/—jt 


Again, the functions (—jt)"f(f) and f(é)/—jé must still fulfill the condi- 
tions for Fourier integral representation.* 


[394] 


*The conditions under which the statements 389, 392, 393, and 394 are valid are actually 
far less rigid than the Dirichlet conditions provided one interprets the results in the light 
of Art. 24. 
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23. THE TRANSFORM OF A PRODUCT AND THE INTERPRETATION OF 
POWER PRODUCTS AND EFFECTIVE VALUES FOR TRANSIENT 
' FUNCTIONS 


When the given function f(#) is expressed as a product of two 
component functions as 


fO =AOk® [395] 


one is interested to know how the transform may be expressed in terms 
of the individual transforms of the components f,(¢) and fo(t). 

It is helpful to observe that this problem is very similar to that of 
determining the expression for the coefficients in the resultant series 


Y= N92 = Ds OB" [396] 

in terms of the coefficients in the component series . 
"= x On” [397] 
and oo | 
ya Baa” [398] 


Forming the product of these two series, one has 


y= ZS ZS. aude an [399] 


mM=—o Nn=—o 
or, letting 
m+n=r [400] 
one may write this as. 
y= 2b ( pa anbim) 2" [401] 
in which the substitution of the summation index r for 7 is permissible 
since the summation over m is independent of that over ». A comparison 
of Eqs. 396 and 401 reveals that the desired relationship reads 


2 


a = Lambs [402] 
Similarly, if ad 
A) = fs) due [403] 
and 


fat) = ff” gar) dv [404] 
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are the Fourier integral representations for the component time functions 
in Eq. 395, one may write 


fildfald) = ff exudgato)er™ du dy [405] 


since w and v are completely independent variables. 
It is now possible to make the change of variable 


vew-—yu, d=dwa [406] 
because » is a constant parameter as far as the integration on » is con- 


cerned in Eq. 405. 
Equation 405 then becomes 


f0) =f dof extudealw — m) du oi [407] 
If this is written 
f0) =f ew) dee [408] 
the transform of f(#) is seen to be given by 
ee) =f evudeo(o — ») de [409] 
which is the desired relation. It is entirely analogous to the result ex- 
pressed by Eq. 402 for the corresponding problem in terms of series. 
Since there is no need for distinguishing between the functions f;(¢)- 
and f2(t) in this argument, it is evident that Eq. 409 remains true if the 


subscripts 1 and 2 are interchanged. 
In an exactly analogous fashion one finds that if 


1 « : 
g1(w) = a) do ei? [410] 
and 
1 ~ . 
ga(w) = 5 f fal) de [41] 
then the inverse transform of the function 


i ss jot 
g(o) = gi(w)ga(w) = oe f SO de [412] 
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is given by* 
FO = 4 [AO - 0) a [413] 


Here, also, the subscripts 1 and 2 may be interchanged. 
Expressing f(£) in this last result in terms of its Fourier integral, one has 


[i eed) do = 5 [" AOhe- odo [At] 


Setting ¢ = 0 and subsequently substituting the symbol ¢ for @ in the 
integral on the right of Eq. 414 yield the result 


[CAO dt = ae f" gile)galee)do [ANS] 


in which the statement 380 is also used. 

This relation is analogous to the result stated for the Fourier series 
by Eqs. 190 and 196, in Art. 10. In other words, the left-hand side of 
Eq. 415 may represent the integrated power product for some physical 
system, whence the right-hand side expresses this integrated power in 
terms of the transforms corresponding to the given time functions. The 
relation 415 is applicable to transient functions in the same way that 
Egs. 190 and 196 are to periodic functions. 

If, in particular, 


fo(=Ft) = fil-#) [416] 
then, according to the statement 381 
. g2 (Fe) = Zi (she) [417] 
or 
ga(-bw) = %1(=Fw) [418] 
The relation 415 for this special case reads 
f : | fr(e)|? dt = Qn f i |g (wo) |? deo [419] 


which for transient time functions expresses an analogous relation to 
that stated by Eqs. 200 and 201 for periodic functions. The square root 
of the value given by Eq. 419 may be called the effective value of the 
transient time function. According to the definition of the effective 
value of a periodic function, one forms the square root of the mean of 
the integrated squared values of that function over a period. The interpre- 


*The mathematical procedure described by this integral is called “ convolution ” (in the 
German literature the term used is “ faltung ”’). 
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tation of the relation 419 offers a complete parallelism between the 
periodic and the transient cases. 

Finally, it is possible to form another relationship which at times 
becomes practically useful. In Eq. 415, fo(=-¢) is any time function, and 
82(-tw) is its transform with the reversed algebraic signs of its. inde- 
pendent variable. If the time function is chosen to be 2xg2(=F4), then, 
according to the statement 382, its transform with reversed signs of its 
variable is fo(=-w). Since the corresponding signs of the variables ¢ and 
w are now alike, the + signs may be dropped, ‘and one has in place of 
Eq. 415, 


LE nOno a= [7 a ofele) do [420] 


In connection with this relation it should be carefully observed that 
the functions g, and gp are understood to be the Fourier transforms of 
the functions f; and f2, and the latter are the inverse transforms of 2; 
and go. The reason for this emphasis is that the reader may make the 
mistake of regarding f2(w) as the transform of go(¢) because the latter is 
written as a function of ¢ and the former as a function of w. This con- 
fusion may best be avoided through adopting an entirely different symbol 
for the independent variable, and using the same symbol for both inte- 
grations since the distinction between given function and transform is 
expressed respectively by the symbols f and g, and has nothing to do 
with the symbols used for the independent variable or variable of inte- 
gration. The preferable form of Eq. 420, therefore, reads 


f "f@)go() dx = f : 81 (%) f(a) dee [421] 


- 24, SoME ILLUSTRATIVE EXAMPLES; THE SINGULARITY FUNCTIONS 


The first function to be considered is defined by 
f@) = 9, fori <0 
f®=e™, fori>0 

This function is illustrated in Fig. 30. According to Eq. 351 its transform 
is given by the integral . 


g(o) = = f * getiodt dy [423] 


[422] 


which yields 
1 


——_—_~ 424 
Qr(a + jo) [424] 


g(w) = 
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This result may be written 


g(w) = g1(w) +Jee(w) = FTE ers [425] 


rey whence 
a 
£1(w) = ne? Tw) [426] 
—W 


Fic. 30. An imation t . A 
ihe nik sen Wee decane According to the relations 377, 378, and 


exponential. 379, these are the transforms of the even 

and odd components respectively of the 

function f(t). The odd components are shown plotted in parts (a) and 
(b) of Fig. 31. Both these components are given by le for ¢ > 0, 
but fi (é) is symmetrical about the vertical axis at the erigin, whereas 


fi(t) 
Fy t 
2 u/2 Ak 
t t 
0 0 ye 
(a) (b) 


2. 


Fie. 31. Even and odd components of the function of Fig, 30, 


fo(t) is antisymmetrical. The results given by Eqs. 426 and 427 may 
alternatively be obtained from the integrals 378 and 379 respectively. 
Since the integrands in these integrals are even functions, the same 
value is obtained if the integration is extended only over the range 0 
to © and the result multiplied by 2. Thus it must be true that 


1 ° at ie sd 
gilw) = 5. f e* cos wh di = > Ca? + F) [428] 
and 
Boe St wee [429] 
2x Jo 2a (a? + w) 


which, incidentally, is a simple way of evaluating these two particular 
integrals (the usual process requires two successive integrations by parts). 
These functions are shown plotted versus the ratio w/a in Fig. 32. 
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Hence the point unity, for example, corresponds to w = a, so that, as a 
is given smaller and smaller values, this point corresponds to smaller 
and smaller values of w. At the same time it should be observed that the 
ordinates of these curves are inversely proportional to a. As a is assumed 
to become smaller, the function gi(w) appears to become more peaked, 
until in the limit a —» 0 it degenerates into a single infinite ordinate at 
w = 0, and is zero everywhere else. 


Fic. 32. The transforms of the even and odd parts given in Fig. 31. 


It should be observed, however, that the area under the curve gi (w) 
is constant and independent of a, for it represents, according to the 
Fourier integral 350, the value of f;(¢) for t = 0, that is, 


fi) = f gil) do = 4 [430] 


As a becomes smaller this area becomes more and more concentrated in 
the immediate vicinity of the point w = 0, until in the limit a — 0 it is 
contained within a region of vanishing width at the origin. Since the area 
remains finite, the ordinate of gi(w) at w = 0 must clearly become 
infinite in this limit. It is quite significant that the function g,(w), there- 
fore, does not vanish for « = 0, as one might at first glance conclude by 
inspection of the function 426. 
From Eq. 427 it is, on the other hand, easily seen that 


limit [¢o()] = 5 [431] 
«0 Tw 


This is the equation of a rectangular hyperbola. The manner in which 
this limiting function is approached by g2(w) as a is assumed to become 
smaller and smaller is readily visualized from an inspection of Fig. 32. 

Turning now to the given function f(¢), one sees that in the limit 
a-—»0 this function is zero for ¢ < 0 and equal to the constant value 
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unity for ¢ > 0. In Fig. 30, the curve for ¢ > 0 then no longer falls off 
with increasing time but maintains the ordinate at ¢ = 0 for all positive 
values of ¢. The limiting forms of the even and odd components shown in 
Fig. 31 are also readily visualized. f,(#) reduces to the constant 4; fo(t) 
equals the constant value —}4 for ¢ < 0, equals the constant value +4 
for 4 > 0, -and retains the discontinuity of unit magnitude at ¢ = 0. 

_ This limiting function, defined by the relations 422 for a = 0, actually 
does not possess a Fourier integral representation, for it no longer fulfills 
the condition 352. As long as a has a nonzero value, however, no matter 
how small, the Fourier representation is possible, and hence, for a proper 
interpretation of the limiting forms of the functions g:(w) and go(w), 
such a representation may be said to be possible even in the limit a = 0. 

With the use of Eqs. 373 and 375. one has 


f@) = fe £1(w) COS wt dw ~ i g2(w) sin wt dw [432] 


According to the preceding discussion of the function g;(w), it is clear 
that as « becomes very small, the total contribution to the first of these 
two integrals is due almost entirely to the values of the integrand in the 
immediate vicinity of the point w = 0. If the variable ¢ is for the moment 
assumed to remain finite, then for a sufficiently small value of a, the 

_function cos w! remains equal to unity over that small range in the 
vicinity of w = 0 which contributes almost wholly to the value of this 
first integral. In the limit «0 this reasoning becomes exact, and in 
view of Eq. 430 one may, therefore, conclude that in or near this limit 
Eq. 432 is equivalent to 


{®O= : — f. Z2(w) sin wt dw [433] 


Substituting the limiting value of the function ge(w), as expressed by 
Eq. 431, one has 


1 © sin wt 
1 2 sin wt 


f® = 


‘which may be written 


fo=5 5+ limit | = f sain au} [435] 


—at OU 


[434] 


According to the discussion of the Si-function in Art. 13, and with the 
help of Eq. 257 the last result may be written, 


i) =5+ limit {2 Si cap} [436] 
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Hence it is clear that, except for the Gibbs phenomenon at the point of 
discontinuity, the Fourier integral representation for this function is 
valid even in the limit a = 0, provided the integrals are properly 
interpreted. 

It should be observed that one cannot simply set a equal to zero in 
the expression 424 for the transform of f(#), for then g(w) becomes iden- 
tical with jgo(w), and g;(w), which contributes the value 14 to the Fourier 
integral representation, is lost entirely. As long as a is retained as a 
small quantity, and discarded only after the proper interpretation of 
the various steps in the process of evaluation, no difficulties are en- 
countered. 

In order to illustrate this point from a slightly different angle, one may 
substitute the value 424 for g(w) into the Fourier integral 350 and have 


1 fe ev 
f@) == f waa je [437] 


With the sine and cosine equivalent of the exponential function, this is 


sO) = 5 


= COs wit ; ° sin wt 
eg Jt Beco 


— d 
ea + jw is QrJ—aa + jw ™ [438] 


The second of these two integrals remains proper in the limit a — 0, and 
hence one may set a = 0 in the second term of Eq. 438 without further 
ado. This term is then the same as the second term in Eq. 434, and hence 
it is evident that the first term in Eq. 438 is supposed to yield the value 14. 
The integral in this term, however, becomes improper in the limit a — 0 
because the integrand then becomes infinite for w = 0. If, nevertheless, 
a is set equal to zero, one observes that the integrand is an odd function 
of w, and since the limits of integration are symmetrical, the value of the 
integral, except for the difficulty in the vicinity of w = 0, should be zero. 
In other words, whatever value this integral may have must certainly 
be contributed by the immediate vicinity of the point » = 0. For this 
vicinity, which may be denoted by —p < w < p, one may again set 
cos wt equal to unity (since for very small increments from w = 0, cos wt 
differs from unity by a small quantity of the second order) and have 
for the first term of Eq. 438 


1 e dw 1 : i 1 | ati] 1 
2rd—-patjw nj Bae Ja) ~p aay Oe —Jjplaso 2 [439] 


which is the correct value. However, the importance of retaining a up to 
the last step during this evaluation process should be noted. Until then, 
a plays an important role toward guiding the evaluation and preventing 
misinterpretations; after this step has been taken, w has served its pur- 
pose and may be retired without causing further difficulties. 
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The integration in Eq. 439 may be understood clearly if it is regarded 
as an integration in the complex plane. Thus, when ; 


f=at+jo [440] 
this integral takes the form 
1 (atie de 
4 — 441 
2nj Jai) F [41] 


The sketch in Fig. 33 indicates the path of in- 
tegration in the complex {-plane. Inasmuch as the 
quantities (a + jp) and (a — jp) have the same 
magnitude, it is clear from the discussion of the 
logarithm function in Art. 17 of the preceding 
chapter that the value of the integral 441, the 
factor 1/217 being omitted, is a pure imagi- 
nary quantity, equal in magnitude to the differ- 
Fic. 33. The path of °° between the angle of (a + jp) and that of 
integration for the in- (a — jp). This net angle clearly approaches the 
tegral of Eq. 441 rele- value 7 as a is allowed to become zero, but it is 
vant to obtaining the to be noted that if a is set equal to zero to start 
unit step from its trans- with, the path of integration lies upon the imagi- 
form by the inverse ee! 

Mourier integral: nary axis in the ¢-plane, and the net angle might 

equally well be regarded as given by —z. With 

a retained, this ambiguity is avoided, and as soon as the correct interpre- 
tation has been seen, a may be discarded. 

The second example to be considered is the function defined by 


f@) = 9, for t< -3 
6 
f@=1, for — 3<t< ; [442] 


f(t) = 0, for . > ; 
This is the same function which is considered in Art. 19 in the discussion 
of the approximation properties of the Fourier integral, as illustrated by 
the plot of Fig. 29. The transform, according to Eq. 351, is given by 


: 6 
sl. w > 


Ww) = 7 feta 71 44 
laa —8/2 Or 6. [443] 
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The similarity of this result to that given by Eq. 187 for the complex 
Fourier coefficients in the series representation of the function illustrated 
in Fig. 12 should be noted. Inasmuch as the latter represents the periodic 
repetition of the function considered at present, this similarity between 
the expression for the Fourier coefficients on the one hand, and the 
Fourier transform on the other is, of course, not surprising. It may be well 
to point out here that the symbol w in Eq. 187 represents the fundamental 
angular frequency, and hence the quantity mw in that equation (not 
just w) becomes the analogue of w in Eq. 443. The latter may be obtained 
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Fic. 34. The rectangular pulse of unit height and its associated Fourier transform. 


from Eq. 187 through writing «; for a, and then applying the limiting 
process indicated in the expressions 349. With reference to Fig. 12, this 
process leaves the rectangular pulse at the origin but causes the adjacent 
pulses to move to infinity, thus yielding in the limit the function defined 
by the statement 442. 

The transform g(w) as a function of w evidently has the appearance of 
the dotted curve of Fig. 13, showing the form of a, as a function of nw. 
The important difference in the present example is the fact that the 
spectrum function g(w) is continuous; that is, all frequencies are present, 
not just integer multiples of a fundamental frequency. Figure 34 shows 
both the time function f(é) and the corresponding transform or spectrum 
function g(w). This spectrum is a continuous one, whereas the spectrum 
shown in Fig. 13 is a line spectrum. It should be remembered, of course, 
that the amplitudes of g(w) are not the harmonic amplitudes. The latter 
are all vanishingly small since their magnitudes are. symbolically indi- 
cated by the differential notation g(w) dw. The function g(w), neverthe- 
less, shows how these amplitudes vary with w for any increment dw 
however small. This is true because, at any stage in the limiting process 
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indicated by the expressions 349, the differential spacing dw of the 
“lines ” in the continuous spectrum is constant, just as the finite spacing 
«, (w in Eq. 187) is in the corresponding periodic case. 

In view of the statement 382 it is useful to observe that if the scale of 
ordinates in the plot of g(w) in Fig. 34 is multiplied by 27, the variables . 
w and ¢ for these plots may be interchanged. That is, 2ag(#) may be 
regarded as the given time function and f(w) as its transform. There is no 
need to reverse the algebraic sign of one of the interchanged variables 
w or ¢ in this example, because the functions are even. Thus, a time 
function like the curve for g(w) in Fig. 34 is seen to have a spectrum like 
the curve for f(t). This spectrum is, of course, also a continuous one, but 
the interesting feature about it is its finite extent. 
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Frc. 35. A modulated cosine wave and its associated transform. 


This result may be predicted on the basis of the discussion in Art. 18 
regarding the resultant interference pattern of a frequency group. It is 
pointed out there that the frequency group of Fig. 27 has an envelope 
function like that plotted in Fig. 28 and expressed analytically by Eq. 
339. As more and more lines are added to this frequency group, until it 
becomes a continuous spectrum of finite width like the function f(¢) in 
Fig. 34, the envelope function of the corresponding interference pattern 
approaches the form given by Eq. 341, which when plotted has the 
appearance of g(w) in Fig. 34. In other words, the function f(¢) in Fig. 34, 
regarded as a spectrum function, is a continuous frequency group whose 
resultant interference pattern (time function) has the appearance of the 
function g(w) in Fig. 34. 

By means of the statement 385 it is a simple matter to determine the 
spectrum function which results when the time function of Fig. 34 is used 
to modulate the amplitude of a sinusoidal function of arbitrary frequency. 
Figure 35 illustrates such a time function, namely, a cosine function 
enclosed by a rectangular pulse. The same figure also shows the corre- 
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sponding transform, which consists of the linear superposition of the 
functions g(w — wo) and g(w + wo). These are obtained through multiply- 
ing the function g(w) of Fig. 34 by 14 and displacing it respectively to the 
right and to the left of the origin by the amount wo, which is the angular 
frequency of the time function within the interval —8/2 < t < 6/2. 

Tt is interesting to consider the way in which the spectrum function 
for this example changes as the duration 4 of the sinusoidal time function 
is increased. One should observe that the functions. g(w — wo) and 
g(w + wo) consist essentially of a large hump whose amplitude is propor- 
tional to 6 and whose width at the base of this large hump is inversely 
proportional to 6. As 6 becomes very large, the spectrum function of 
Fig. 35 assumes the form of two very tall slim humps, one at wo and the 
other at —wo. In the limit 6 > © the spectrum function is given by two 
lines at the points -two, as one should expect from the fact that the single 
frequency wo alone then characterizes the resulting time function. 

A type of function very useful practically is readily derived from the 
function f(é) in Fig. 34. If the amplitude of this function is set equal to 
1/6 instead of unity, the area enclosed by the rectangle equals unity no 
matter what 6 may be. For a large value of 5, the rectangle is long and low, 
for a small value of 4, it becomes thin and tall. The transform g(w) 
corresponding to this f(¢) is given by the expression 443 divided by 4, 
that is, by 


ee 
sin w= 
1 2 
= — 444 
g(w) 5s ; [444] 
mo 


If 6 is now allowed to approach zero, f(é) degenerates into a single 
infinite ordinate at ¢ = 0; that is, the rectangle has zero width and an 
infinite height but still encloses unit area. This limiting form of the 
function f(t) of Fig. 34 is called a unit impulse and may be denoted by 
uo(t). Its transform is given by the expression 444 for the limit 6 — 0. 
With this limiting value of the transform denoted by v(w), it is readily 
seen that 


v(w) _ ~ ; [445] 


In other words, the transform of the unit impulse is a constant. 

In view of the preceding discussion of the function gi(w) of Fig. 32 
for the limiting process a — 0, this result may be arrived at in a different 
manner. The inverse transform of g,(w), namely f,(¢) of Fig. 31, becomes 
equal to the constant }4 for a—0; g1 (w) degenerates into a single 
infinite ordinate at w = 0, although the area enclosed by the curve for 
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gi(w) remains constant and equal to 14. According to the statement 382, . 
the transform of 27g; (#) in the limit a — 0 is the constant 14. Hence the 
transform of 2g;(é) for a 0, which is an impulse enclosing unit area, 
is the constant 1/27, the same as v(w). In other words, the function 
2g; (t) for the limit a — 0 may be identified with the unit impulse u(#), 
and its transform may be identified with v(w) in Eq. 445. It is rather 
interesting that this conclusion should be true, in view of the fact that 
u(t), in the argument of the preceding paragraph, is approached by the 
rectangular pulse f(¢) in Fig. 34, and the limiting form of 2g,(¢) is ap- _ 
proached by the rounded pulse of Fig. 32. 

The fact that the transform of the unit impulse is constant and equal 
to 1/27 may also be seen graphically. With reference to Fig. 34, if f(t) 
is multiplied by 1/6, then g(w) is multiplied by 1/5 also. Its amplitude 
is then independent of 6. As 6 becomes smaller, the distance from the 
origin, » = 0, to the points w = +27/8 becomes larger. For a very small 
6 (tall narrow rectangular pulse) this distance is so large that, over a wide 
range of frequencies, g(w) drops off very little from its value of 1/2m at 
w = Q. Finally, as 6 approaches zero, the points # = +22/5 move to 
infinity, and the curve for g(w) becomes a horizontal line 1/27 units above 
the w-axis. 

Conversely, one may let g(w) in Fig. 34 approach the unit impulse. 
The area under this curve already equals unity because it represents the 
value of f(0), according to the Fourier integral 350. Making use of the 
statement 382 again by saying that (1/27)f(w) is the transform of g(¢), 
and this time letting 6 approach infinity, one finds that g(é) approaches 
uo(t), and (1/2mr)f(w) approaches the constant 1/27. 

The next function to be considered is shown in Fig. 36. This is the 
integral form — to ¢ of the function f(¢) in Fig. 34 multiplied by 1/6. 
Since the transform of the latter is given by Eq. 444, it follows from the 
statement 392 that the transform of the present time function is given by 


, 6 
sm o> 
1 2 
go) = |] [446] 
"5 


For small values of w this transform behaves like the g(w) of Eq. 424 
for a = 0, that is, like the transform of the time function defined by the 
relations 422 for a = 0. In other words, the transform 446 for the function 
illustrated in Fig. 36 has a nonintegrable infinity at w = 0. This is due 
to the fact that the present time function does not fulfill the condition 
352, and the same difficulty occurs as discussed in connection with the 
function defined by the relations 422 for a = 0. Since the method of 
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dealing with this difficulty is now understood, however, the transform 
given by Eq. 446 may be accepted as an integrable function. 

As 8 is now allowed to approach zero, the function of Fig. 36 assumes 
the form shown in Fig. 37. This form, however, is the same as that of the 
function defined by the relations 422 and illustrated in Fig. 30 for the 
limit a — 0, as is also evident from the fact that the transform 446 for 
§ — 0 becomes identical with the transform 424 for a — 0. As indicated 
in Fig. 37, this limiting form of the function of Fig. 36 is denoted by the 
symbol w_; (t). 
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Fic. 36. The integral of a rec- Fic. 37. The function of 
tangular pulse similar to that of Fig. 36 where 6 is allowed 
Fig. 34. to approach zero. 


Since the function of Fig. 36 is the integral from — to # of the 
1/s-multiplied time function of Fig. 34 for any value of 5 however small, 
one may regard the function of Fig. 37 as representing the integral of the 
unit impulse %o(¢). Symbolically this fact is expressed by 


“ee f _uolt) di [447] 


in which the function u_,(#) is called the unit step function or, more 
briefly, the unit step. It is defined by the relations 


u(t) = 0, for t<0 
ust) = 1, for i>0 


and hence is identical with the function defined by the relations 422 for 
a = 0. It plays an important part in the Heaviside Operational Calculus, 
but in the more recent expositions of this subject the unit impulse func- 
tion uo (t) is found to be of greater value, chiefly because its transform is a 
constant. 

From these discussions it becomes clear that the unit impulse may 
alternatively be expressed as the time derivative of the unit step, that is, 


[448] 


u(t) = es [449] 
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This is in agreement with the statement 389 since the transform for the 
unit step is given by Eq. 424 for the limit a — 0, and this result multiplied 
by jw reduces to 1/27. 

From a conservative mathematical point of view the differentiation 
of a function having a discontinuity is considered not permissible and is 
regarded as having no meaning. In view of the present discussion, how- 
ever, it is clear that such operations are permissible provided they are 
properly interpreted. Thus it is possible to define functions and corre- 
sponding transforms for successive derivatives of the unit impulse. The 
first derivative is written 


n(¢) = [450] 


and its transform, according to the statement 389, is 


jov(w) = = [451] 

> 5 The function 1“ (é) is called the unit doublet. The 

oe reason for this name is clarified by reference to 
ye? Fig. 38 which illustrates graphically the manner 


t in which the derivation of this function is to be 
interpreted. Starting with the function shown 
in this figure, one obtains the unit doublet 2; (¢) 

ask _by passing to the limit 5 — 0. Since this limiting . 

process may be indicated symbolically through 

replacing 6 by the differential time increment df, 
the correctness of this graphical interpretation 
may be seen analytically from the fact that 


1s? 


Fic. 38. A rectangular 
. pulse doublet. 


dug _ u(t + dt) — uo (Et) 
di dt [482] 
or 
dh to (# + *) ~ uo (1 — *) 
— = [453] 
dt di 


When the statement 386 is applied, the corresponding transform is seen 
to be 
v(w)eF4t!2) — y(w)e FH! — v(w)jw dé 


a ao jod(w) [454] 


which agrees with the discussion surrounding Eqs. 450 and 451. 
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The unit doublet is evidently equivalent to a pair of equal but opposite 
impulses which are immediately adjacent to each other at the origin. 
The net effect may be likened to that of a couple in mechanics, and for 
this reason the term couple is sometimes used in place of the term doublet 
to designate the function 1; (#). 

It should be observed that the two impulses involved in this interpre- 
tation are not unit impulses. As shown in Fig. 38, the area enclosed by 
each rectangular pulse has the value 1/5. In the limit 6 — 0 this area 
becomes infinite, and the resulting impulse is seen to be one of infinite 
value rather than one of unit value. These considerations are pertinent 
to the proper interpretation of the expressions 452 and 453 inasmuch as 
uo/di symbolically represents an impulse of infi- 
nite value. >| 8 kd 6 

Through continuing in the same way a se 
quence of functions may be formed. The next in 1/8 


order is defined as t 
du, duo 
| —— ee —_ eel 5 
a) =", = GP [455] 2/8° 
and its transform, according to the statement 389, 
is given by 

y)2 Fic. 39. A rectangular 

Goy'n(o) = ZX [456] “pulse triplet 


The function «.(#) may be interpreted graphically as the limit of 
the time function shown in Fig. 39 as 6 is allowed to approach zero. It is 
equivalent to two equal but opposite doublets of infinite value centered 
about the origin and separated by the increment 6 = dt. Thus one may 


write ; 
( 4 5) ‘i (: *) 
uy >] “4 Spy 
du; 2 2 2 


and, making use of the statement 386 again, have for the corresponding 
transform 
jor (w) eft! — juv(w)e Fort!) 
JOON SN 
at 
It is also possible to extend this sequence of functions in the opposite 
direction by successively integrating uo (é). , 
The first integration yields the unit function 1 (t) or unit step. The 


next integration yields a time function which is linearly proportional to 
the time, like the current through a pure inductance when a constant 


= (jw)?0(w) [458] 


544 FOURIER SERIES AND INTEGRALS (Ch. VII 


voltage is applied. In connection with practical problems there is little 
actual use for any of the functions in this sequence except the unit 
impulse and the unit step, although a recognition of the availability 
and the interpretation of the general sequence of functions together with 
their transforms proves to be a useful tool in the application of Fourier 
integral analysis to various practical problems. 

According to the preceding discussion it should be clear that the 
transform of the general function u,(¢) in this sequence is (jw)"/2x. The 
sequence is referred to as the singularity functions, the unit impulse and 
the unit function being singularity functions of the order zero and minus 
one respectively. It should be observed that the singularity functions 
multiplied by 2x are the inverse transforms of the integer powers of (jw). 
Inasmuch as the inverse transform of (jw)” cannot be found in the usual 
fashion because the integral 350 becomes improper, the discussion in 
this article may essentially be regarded as an interpretation process 
which avoids this difficulty and demonstrates the existence of such 
integrals under suitable limiting conditions. 


25. THE ERROR FUNCTION AND THE SEQUENCE OF SINGULARITY 
FUNCTIONS BASED UPON .-IT 


It has been seen from the preceding article that the unit impulse and 
its transform may be obtained through applying to other suitably chosen 
functions, besides the rectangular pulse, a limiting process by which 
they degenerate into a single infinite ordinate enclosing unit area. Corre- 
spondingly, the entire sequence of singularity functions and their trans- 
forms may be derived through applying suitable limiting processes to 
a variety of properly chosen functions. One of the most interesting of 
these is the so-called error function which has the form 


fQ=e" [459] 


_ According to the integral 375, the transform of the error function is 
given by 


: ak cd —at? 
go) == f cos wt dt [460] 
The integration yields 
in [461] 


Here it is, incidentally, interesting to observe that for the choice a = 4, 
one has 


f@) = #2 [462] 
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and 
el 2 


V U8 


In other words, the time function and its transform are identical except 
for a scale factor. In particular, if the transform is defined as g*(w), 
according to Eq. 365 and the Fourier integrals 366 and 367, the scale 
factor becomes unity and the time function is identical with its trans- 
form. This situation may also be achieved by using the integrals 369 
and 370, for which the transform (considered as a function of the cyclic 
frequency f = w/2n) is defined in terms of g(w) by Eq. 368. Thus, with 
a in Eqs. 459 and 461 equal to 7, and Eq. 368 being used, the time func- 
tion reads 


g(w) = [463] 


h(t) = e-*® [464] 
and its transform becomes 
af) =e? [465] 
fw 


Fic. 40. The error function used in consideration of the singularity functions. 


Returning now to the Eqs. 459 and 461, letting @ = «/a and multiply- 


ing the resulting time function and its transform by the factor 1/Va, 
one has for the time function 


en tela 
[® = pe [466] 
and _for its transform 
eaetltea 
go) = —> [467] 


This time function is plotted in Fig. 40. Since the area under the time 
function equals 27g(0), it is clear that this area equals unity independent 
of the value of the parameter a. From Fig. 40 it is seen that as a becomes 
smaller the curve for f(¢) becomes taller and narrower and, in the limit 
a — 0, has the character of the unit impulse (¢). At the same time one 
recognizes, from the form of Eq. 467, that g(w) in the limit a + 0 becomes 
equal to the constant value 1 /2x, and hence identical with the transform 
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v(w) of the unit impulse. Thus the first of the sequence of singularity 
functions is obtained from f(#) of Eq. 466 for the limit a — 0, and its 
transform is found from Eq. 467. 

The remainder of the singularity functions are obtained from the 
successive derivatives of the function f(/) of Eq. 466 for the limit a => 0, 
and the corresponding transforms of this sequence of functions follow 


Fic, 41. The first derivative of the error function used in consideration of the unit 
doublet. . 


from the use of the statements. 389 and 392 in connection with the 
transform 467. For example, the first and second derivatives of Eq. 466 
read 


af 2rt 
igi tae te [468] 
and 
ad? Qa (2rt? — a) _ 
ie ae [469] 
whereas the corresponding transforms are given respectively by 
8 (w) = jog(w) = = eat [470] 
and 
” * -\2 (jw)? w2/4 ; 
& (w) = (jo)*g(w) = er ia [471] 


These time functions are shown in Figs. 41 and 42, from which it is 
clear that for a> 0 they have the general character of the singularity 
functions «(¢) and w2(¢). In other words, the function 468 approaches 
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the unit doublet as a approaches zero, and the function 469 approaches 
the singularity function of order two. 
' It is interesting to compare the time functions of Figs. 34, 38, and 39, 
respectively, with those shown in Figs. 40, 41, and 42, and observe that, 
although the-one set of curves is rectangular and the other set is smooth, 
both sets approach the same sequence of singularity functions when 
suitable limiting processes are carried out. This is evident from the fact 
that the transforms 467, 470, and 471 become identical with v(w), 
jwv(w), and (jw)?v(w) in the limit « > 0. 


Fic. 42. The second derivative of the error function used in consideration of the 
singularity function of order two. 


_Since the error function and all its successive derivatives are smooth, 
use of them in the derivation of the singularity functions does not in- 
volve the mathematically doubtful steps encountered when the rectangu- 
lar pulse and its derivatives are used for this purpose. 


26. RELATION TO CONTOUR INTEGRALS 


In applying the Fourier method of analysis to practical problems, one 
frequently encounters functions g(w) which are quite complicated, and 
for which the evaluation of the synthesis integral 350 presents some 
difficulty. Although the function g(w) may be complex, it should be clear 
that the integration is carried out with respect to the real variable w, 
and hence is essentially an integration of a function of a real variable. 
Inspection of the integral representation of g(w) according to Eq. 351, 
however, discloses that the variable w occurs only in the exponent of 
the exponential function and is there associated with the operator 7. 
This fact shows that the transform g(w) may alternatively be regarded as 
a function of jw and written in the form g(jw). If this is done, and the 
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synthesis integral 350 is rewritten in the modified form, 
Ny eee . 
FO) = 5 Jf, (de) du) [472] 


it appears that this process of modification may be carried a step further 
through introducing the formal change of variable 


A = jw [473] 
writing Eq. 472 in the form** 


Hi) = 5 f"" go) ano [474] 


and, regarding g(A) as a function of a complex variable, 
A=otjw [475] 


evaluating the integral 474 by the method of complex integration dis- 
cussed in Art. 15 of the previous chapter. This method of integration, 
which is thus made available for the evaluation of the synthesis integral, 
holds promise for the simplification of many problems which present 
almost insurmountable difficulties unless one is exceptionally skilled in 
the art of real integration. 

Before this method of dealing with the synthesis integral may be 
utilized, however, it is necessary to clarify several significant points 
which, in the above formal steps, are left in a somewhat doubtful state. 
First it is necessary to assure oneself that the complex transform g(A), 
which is obtained from the ordinary Fourier transform by the simple 
expedient of replacing 7w by a complex variable X, is in fact the analytic 
continuation of the function g(jw) into the complex domain. Such a 
justification is called for because the Fourier integral 351 defines the 
complex function g only in terms of the real variable w, or one may say 
that it defines the function g(A) only for values of \ on the imaginary 
axis of the A-plane. In other words, the Fourier integral 351 does not 
establish the existence of the function g(A) for all points in the -plane. 
In fact if one writes the integral 351 in the form 


eo) =5- [sO dte™ [476] 


one immediately recognizes that this integral converges for ¢ < 0 only 
for points in the left half of the A-plane and for ¢ > 0 only for points in 
the right. half of this plane. 


*Rewritten in terms of the variable A, the Fourier integrals are commonly referred to as 
Laplace’s integrals and g(A) as the Laplace transform of f(£). 
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The existence of g() for all complex values of \ is, however, readily 
established with the help of the principle of analytic continuation and the 
uniqueness theorem for analytic functions (see Art. 11 of Ch. VI). Thus, 
suppose that a function g(jw) is given (that is, the function g() for 
= jw) and let the problem be to find its analytic continuation into the 
\-plane. Suppose furthermore that somehow a function F(A) of the com- 
plex variable \ is found which is identical with g(A) for all points on the 
imaginary axis (actually it is only necessary that F(A) and g(A) be 
identical for all points on an arbitrarily small portion of the imaginary 
axis or for an infinite number of discrete points having a limit point on 
the imaginary axis). Then, according to the uniqueness theorem, the 
functions F(A) and g(A) are identical everywhere in the A-plane, and 
hence F(A) or g(d) is the desired analytic continuation of g(jw). 

Returning now to the integral 474, one finds that interpretation of it in 
terms of the method of complex integration raises a second pertinent 
question. The method of complex integration requires that the path of 
integration be in the form of a closed contour. According to the integral 
474, the path is the entire imaginary axis, from minus infinity, through the 
origin, to plus infinity. Since infinity is regarded as a single point (this 
view is most easily appreciated through considering the complex plane 
replaced by its associated complex sphere), one may say that the closed 
contour requirement is met by the integral 474. A difficulty arises, how- 
ever, because the integrand has an essential singularity at the point at 
infinity, since it contains the factor e’’. In the immediate vicinity of such a 
singularity a function is capable of assuming any assigned values (see 
Art. 13 of Ch. VI), and hence the method of passing through such a 
point cannot easily be disposed of. Unless the path of integration in the 
integral 474 is closed by passage through or around the point at infinity, 
the methods of complex integration cannot be applied, and yet the proc- 
ess of supplying this gap in the path of integration must be accomplished 
in a way which does not affect the value of the integral. 

For the following argument it is necessary to assume that g(A) is a 
rational function and that, for large values of \, it vanishes at least as 
1/d. Modifications in the procedure which are called for when g(d) does 
not fulfill these conditions are more appropriately considered later. In 
the immediate vicinity of the point at infinity the integrand is then 
essentially represented by the factor e/d. If the -plane is, for the 
moment, regarded as replaced by its associated complex sphere, ac- 
cording to the method of stereographic projection, one now contemplates 
by-passing the point at infinity by means of a path increment in the form 
of a small semicircular detour concentric with this point. In the ordinary 
\-plane this detour corresponds to a semicircular path of very large 
radius with the origin as a center, as indicated in Fig. 43. This large 
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semicircle lies in the right or left half of the \-plane according to whether 
the point at infinity is by-passed on the right or on the left. 

In order that the detour shall contribute a negligible increment to 
the value of the integral, it is clear that e’/) must at all events remain 
bounded for points on the detour. For such 
points it is also clear that A has large values 
and that these values become infinite as the 
radius of the semicircular path about the point 
at infinity is made smaller and smaller, that is, 
as the radius R of the corresponding path in the 
\-plane of Fig. 43 becomes larger. and larger. 
In order for e/A to remain bounded, it is 
seen, therefore, that, for ¢ <0, the detour 

must lie in the right half plane where the 
Fic. 43. The paths for eal part of 2 is positive and that, for ¢ > 0, 
replacing the Fourier in- ge 
tegral along the imaginary the detour must lie in the left half plane where 
axis by a contour integral. the real part of d is negative. 

It remains to show that, with this choice 
of detours for ¢ < 0 and ¢ > 0 respectively, the contributions to the 
integral 474 due to these added path increments are negligibly small. 
That is, one must show that the integral 


ent 
cam f <a [477] 


extended over the semicircular paths as indicated in Fig. 43 has a negli- 
gible value for a sufficiently large value of R. For points on the semicircle 


» = Re® = R (cos @ + / sin @) [478] 
and 
o = jd : [479] 


so that the integral 477 becomes . 
I =j [me : ekt cos? Jag [480] 


Since the factor e#*'*® has unit magnitude, it is clear that the value of 
the integral 480 extended over either of the two semicircular paths is 
certainly less than it would be if this factor in the integrand were omitted. 
As far as the magnitude of the integral is concerned, one may also drop 
the factor 7 and have 


Z| < f eft coe? dg [481] 
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According to the choice of paths for ¢ < 0 and ¢ > 0, it is observed that 
for t < 0 the limits of integration are from @ = /2to@ = —2x/2, and that 
for £ > 0 they are from @ = 4/2 to @ = 32/2. For either path, the expo- 
nent Ré cos @ is negative, and since |cos 6| < 1, one may state that for 
i <0, 


—x/2 /2 9 (1 = e~Fitl) 
2.008 8 —=Rtl6 _ FNS St 


and for ¢ > 0, 


x /2 /2 9 (1 a: e~Fltl) 
tcoB 8 — <Flte a 
ll < S. emt dy 0 fem ZR do Ra [483] 


For any nonzero |é|, the value of || may, therefore, be made arbitrarily 
small through choosing a sufficiently large value of R. 

According to the theory of contour integration, one recognizes that 
the radius R need be chosen only large enough so that the semicircular 
paths in Fig. 43 together enclose all the poles of the integrand. Since 
eis an entire function, these poles are those of the rational function g(a). 

As a simple illustrative example, let the time function f(¢) be the unit 
step u_1(¢) with the transform g(A) = 1/2ad. The synthesis integral 
then reads 
j 20 et 


1 
u_r(t) = Inj 
In evaluating this contour integral one must again be reminded of the 
fact (discussed in Art. 24) that the transform of the unit step function 
is to be regarded as the limit of the function 


1 
2n(a + 2d) 


for a > 0. The pole of this function lies at the point \ = —a, and since 
« is an arbitrarily small but nevertheless nonzero quantity, one observes 
that the pole of the integrand in the integral 484 must be regarded as 
lying, not at the origin of the d-plane, but slightly to the left of this 
point. The closed contour for ¢ <0, therefore, does not enclose this 
pole, but. the one for ¢ > 0 does. Inasmuch as the residue of the integrand 
in this pole is unity (see Art. 15, Ch. VI, for the evaluation of residues), 
one readily recognizes that the integral 484 correctly represents the 
unit step function. 

The necessity of writing the transform for the unit step in the form 
given by Eq. 485 and considering the limiting process indicated by 
a — 0 may be avoided if the path of integration in the vicinity of the 
origin is regarded as modified as shown in part (a) of Fig. 44. Instead of 


gQ) = [485] 
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passing through the origin, the path along the imaginary axis avoids the 
origin by passing to the right of it along a semicircular detour of vanish- 
ingly small radius. That a detour of this sort 
is necessary follows from inspection of the 
integral 484 inasmuch as a pole of the inte- 
- grand lies upon the path of integration. How- 
ever, unless one is aware of the limiting proc- 
ess which, in the limit, causes this pole to 
be located at the origin, one cannot know 
whether to by-pass this pole on the right or 
on the left. A knowledge of the limiting proc- 
ess which is necessary for the proper inter- 
(a) pretation of the Fourier transform for the 
Fic. 44. Modification of the unit step function is thus seen to be neces- 
path of integration in the SaTY also for the removal of ambiguity in 
vicinity of poles of g(a). the reverse process of regaining this time 
function from its transform. 
It is possible for the integrand in the contour integral 474 to have 
several poles on the imaginary axis. A case of this kind arises when the 
time function has the form 


f(t) = u_r(é) - cos (wot + ¢) [486] 


which represents a steady sinusoid starting at ¢ = 0. With the trigo- 
nometric function replaced by its exponential equivalent, the function 
486 may be written 


FD) = walt) - Flector’ + ee Font} [487] 


Utilizing the statement 385, one finds for the corresponding transform 


ei? 1 ee 1 
SOV eg) de kee 


[488] 


in which 
No = Joo [489] 


The integrand in the contour integral for this g(\)-function evidently 
has poles at the points \ = -jwo on the imaginary axis. In the evalua- 
tion of this contour integral, these poles must evidently be by-passed in 
the manner indicated in part (b) of Fig. 44. The residues of the integrand 
in these poles are seen to be respectively 


gis 
4a ; Ar 


= g het [490] 


whence, observing again the selection of proper contours for ¢ < 0 and 


* 
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i > 0, one sees that the evaluation of the contour integral in this case 
correctly yields the time function 486 or 487. 

In each of the two examples just discussed, one observes that g()) 
does fulfill the condition of vanishing at least as strongly as 1/) for large 
values of X. It is appropriate at this point to consider in greater detail the 
necessity for this condition. One should here recall the discussion in 
Art. 8 relative to the character of Fourier coefficients for periodic func- 
tions which either are discontinuous themselves or possess discontinuities 
in their derivatives of the first or higher order. It is pointed out there 
that if the time function is discontinuous, the Fourier coefficients can 
become smaller no faster than 1/vw, in which » is the order of the harmonic 
coefficient and w is the fundamental angular frequency. If the function’ 
is continuous but its first derivative is discontinuous, the coefficients can 
become smaller no faster than 1/»?w?, and so forth. 

Since these statements must obviously remain true as the period of the 
periodic function is made larger and larger, they apply also to the Fourier 
integral representation of a transient time function and its transform. 
The singularity functions u,(é), whose transforms are 1/2m\” are ap- 
propriate examples of this property. Inasmuch as the converse of these 
statements is evidently also true, one observes that the restriction that 
g(d) shall vanish for large at least as fast as 1/ is seen to imply that the 
corresponding time function shall possess nothing worse than discon- 
tinuities. In other words, the method of contour integration is applicable 
to the evaluation of the synthesis integral only if the corresponding time 
function contains terms involving singularity functions of the order —1 
or less. The impulse, for example, cannot be regained from its transform 
by the method of contour integration unless one employs special devices 
involving limiting processes similar to those used in the derivation of the 
transforms of such higher order singularity functions. 

From a practical point of view this restriction on the function g(A) 
is hardly serious since the behavior of a physical system never exhibits 
the properties of an impulse or its derivatives unless the data are de- 
liberately idealized. Moreover, one can, in such idealized cases, always 
apply a simple artifice to overcome the difficulty imposed by this re- 
striction. For example, suppose g(X) approaches a constant value for 
large values of \, implying that the corresponding time function contains 
an impulse. If the integrand is arbitrarily multiplied by 1/), the state- 
ment 392 shows that the corresponding time function is replaced by its 
integral. Contour integration may then be applied, and the desired time 
function found through differentiating the result. In general, one may 
multiply the function g(\) by whatever power of 1/\ is needed to obtain 
the proper behavior for large values of \ and, after evaluating the contour 
integral, differentiate the result a corresponding number of times. 
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During this subsequent process of differentiation, one must observe 

the following precautions. If the time function resulting from the contour 
integration contains discontinuities, its derivative contains a corre- 
sponding number of impulses. For example, suppose a time function f(#) 
has a discontinuity of the value # at ¢ = fp. The first derivative of f(¢) 
then contains the term + uo(¢ — tf), its second derivative contains the 
term h-1,(¢ — t), and so forth. Each discontinuity is treated in this 
manner whether it appears in the function f(f) itself or in any of its 
subsequently formed derivatives. Besides terms of this sort, the deriva- 
tives of f(é), of course, also contain terms representing the derivatives 
of the smooth portions of f(é). 
' The second restriction which is placed upon g(A) in the above discus- 
sion of the resulting contour integration, namely, that g(d) be a rational 
function, may with some reservations next be relaxed to the extent of 
allowing g(A) to be a meromorphic function. As pointed out in Art. 18 
of Ch. VI, this class of functions is more general than the rational ones 
in that the point at infinity may be an essential singularity. Thus g(d) 
is allowed to be a transcendental function, although its singularities in 
the finite \-plane must still be ordinary poles. 

This relaxation of the conditions imposed upon g(d) requires further 
consideration of the process of contour integration from two aspects. 
These are concerned, first, with the effect of the essential singularity at 
infinity and, second, with the possibility of an infinite number of poles in 
the rest of the A-plane. A few simple examples will best illustrate how 
these matters may be dealt with. 

Suppose the time function is the rectangular pulse defined by the 
relations 360. Its transform is given by 


. md 6 
x ~ f MM dt i 2 491 
gQ) = in Bg a | [491] 


Tv 


This is an entire transcendental function. Its only singularity is the one at 
infinity. Since the integrand in the contour integral 474 in this case has no 
poles at all, the entire process of evaluating this integral centers about the 
question of how the gap in the contour, which exists at the point at 
infinity, may be closed without affecting the value of this integral. 
Offhand, one may be tempted to conclude that the value of the integral is 
zero because the integrand has no poles. This conclusion is false, however, 
inasmuch as it is based upon the tacit assumption that the closure of the 
gap in the path of integration at infinity is to be dealt with in the manner 
described for rational g(\)-functions. Such an assumption is presumptive. 

The clue which leads one in the right direction is found through re- 
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wasting the function 491 in the eo. 


g(a) = —— + (ou ~ ¢-¥/2) [492] 
whence the integrand in the ee 474 becomes 
1 
on (e512) _ ghit-8/2)) [493] 


which, of course, is still an entire transcendental function. However, if 
the integrand is separated into two terms, each term is seen to have a 
simple pole at \ = 0, and in the vicinity of the point at infinity to behave 
in the same manner as already described for contour integrals involving 
rational g(A)-functions except that the variable ¢ is replaced respectively 
_by (¢ + 8/2) and (¢ — 8/2). The paths which previously were chosen for 
t <0 and > 0 are now chosen for (¢ + 6/2) < 0 and (¢4 8/2) > 0 
respectively. Except for these changes, the integral for each term has the 
form of Eq. 484 for the unit step function. Hence one obtains the result 


fi) at (: + ) ae (: = 4) [494] 


which is recognized without difficulty to meet the definitions 360, 

It should be observed that the question of how to close the gap in the 
path of integration at infinity is, in this example, resolved only after the 
integrand is separated into two terms, for the proper resolution with 
regard to one of these terms is different from that in the other. Unless 
‘the integrand is separated into two terms, it is obvious that the method of 
contour integration cannot be carried out for lack of an appropriate 
method of closing the path of integration. The method of contour integra- 
tion can be applied to other g(A)-functions having essential singularities 
at infinity only if similar artifices can be devised for dealing with this 
question. 

Regarding the possibility of encountering transcendental g(\)-functions 
having an infinite number of poles in the finite \-plane, one may consider 
the example in which is sought the current response to a unit step voltage 
at the driving point of a lossless open-circuited transmission line. Except 
for a constant multiplier, the g(A)-function in this case is of the form 


gos “ : [495] 
which has simple poles at the points 
= sd Ce [496] 


and an essential singularity at infinity. 
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The question of how to close the gap in the path of integration at 
infinity is disposed of through observing that g(A) remains bounded for 
large values of \ in the right or left half plane (which is not the case with 
the function 491 of the previous example). Hence large semicircles like 
the ones shown in Fig. 43 can be found on which the contributions to the 
contour integral for ¢ < 0 and ¢t > 0 are negligible. This question may, 
therefore, be resolved in a manner similar to that discussed for rational 
g(A)-functions. 

It remains to determine how one shall deal with the infinite number of 
poles of g(A) inasmuch as any semicircular path, no matter what its 
finite radius may be, cannot enclose all these poles. The difficulty pre- 
sented by the fact that these poles lie upon the path of integration is, 
incidentally, overcome through avoiding them by means of small detours, 
after the fashion shown in Fig. 44. This procedure is valid since, for a 
transmission line with some loss, however small, the corresponding poles 
lie in the left half of the \-plane. 

The residues of the integrand in the integral 474 with the g(d)-function 
of Eq. 495 are found to be given by 

+12 
_ sinh* dy ryt — 2 gxivati2 [497] 


Pp = x vag 


The significant point about this result is that the residues vary inversely 
as dy. Hence they become smaller and smaller for poles which lie more 
and more remote from the origin of the \-plane. The terms in the corre- 
sponding time function, therefore, become negligibly small for very 
remote poles. Since any number of terms in the desired time function 
are readily calculated, the question of choosing a sufficiently large radius 
R for the semicircular paths of Fig. 43 evidently depends upon the 
degree of approximation to which the result should be determined. The 
contour integral yields the time function in the form of an infinite series, 
which incidentally is recognized as being the Fourier series for a square 
wave. 

By making use of the statement 392, and observing that multiplying 
g(d) by 1/X” has the effect of multiplying the corresponding residues by 
1/d,”, one obtains a much more rapidly convergent series, but this effect 
is canceled by the subsequent »-fold differentiation. In practical prob- 
lems in which the form of the resulting time function is not recognized 
from inspection of its series representation so easily as it is in the present 
example, a reasonable number of terms usually suffice for a sufficiently 
good approximation provided the series converges. The question of the 
convergence of the series can always be examined by means of the expres- 
sion obtained for the residues. 

In some practical problems one may also encounter g(A)-functions 
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which are multivalued. For example, the determination of the response 
of an artificial transmission line to an applied unit step voltage leads to 
the transform 


1 
8) = + (VX? + a +A)" 


in which (the number of line sections) is an integer. The inverse trans- 
form is given by the integral 


[498] 


Px 
j== ———— ad 499 
fi) ee [499] 
The first step in the process of simplifying this integral is to let 
r 
a = Ww [500] 
which converts 499 into the form 
faye f - = [501] 
US Oj Jie Va FAV #1 + oe 
Next one introduces the change of variable indicated by 
1 1 
w= 3( = *) [502] 
whence | 
Vas AG - *) [503] 
and, from the addition or subtraction of these two equations, one finds 
z=Vw?+1+w [504] 
and 
-= Vi FI -w [505] 
Forming the differential of Eq. 504 yields 
2 
pate ee [506] 
Vw? + 1 


whence, using Eq. 504 again, one has 


fe sat ae 0 
5 aE [507] 
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By means of Eqs. 502, 504, and 507, the integral 501 assumes the greatly 
simplified form 

I ele) at/2 
flat) = nf gh dz [508] 
in which the contour C in the z-plane must be chosen to correspond to 
the contour in the w-plane which is implied in the evaluation of the inte- 
gral 501 by the method of complex integration. 

Because of the multivalued character of the integrand in Eq. 501, 
the determination of an appropriate closed contour must be considered 
with some care. First it is observed that for large values of w the trans- 
form g(w) varies as 1/w"*', and hence, even for = 0, it fulfills the 
requirements stated earlier regarding the method of closing the gap in 
the path of integration at the point at infinity. Hence one may again 
contemplate closing this gap in the manner shown in Fig. 43 fort < 0 
and ¢ > 0 without affecting the value of the integral. 

Next it is seen that for integer values of 7, the Riemann surface for 
the integrand of Eq. 501 has two leaves. The values w = +7 are branch 
points, and the portion of the imaginary axis between these is regarded 
as a branch cut. Since the point at infinity is not a branch point, the 
two leaves of the Riemann surface remain separate in this vicinity, and 
the path of integration around this point, therefore, remains on one of 
these leaves. 

The branch points, which also are simple poles of the integrand (see 
the discussion immediately following Eq. 256 in Art. 18 of Ch. VI), must 
be avoided in the manner already described by means of small semi- 
circular detours in the right half plane. In so doing, the path of integra- 
tion also remains on the same leaf of the Riemann surface (as discussed 
in greater detail below) and the condition that the contour be closed is 
fulfilled. 

Jn order to determine the contour C in the z-plane for the integral 508, 
it is necessary to consider in greater detail the substitution 502 and its 
inverse 504. For this purpose it is effective to determine the conformal 
map in the w-plane corresponding to orthogonal families of concentric 
circles and radial lines symmetrical with respect to the origin in the 
z-plane. With 


g=reit [509] 


the loci in the z-plane are defined by r = const and ¢ = const. Substitut- 
ing into Eq. 502, one has 


w =u +jo= 5 (cose +jsin 4) — 5 (cos. ~fsin 6) [510] 
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u 


1 1\.. 
v= 3(- + *) sin ¢ 
Eliminating ¢ on the one hand and r on the other, one obtains respectively 


v u? 


ll 
bw! 
o_ 

be 

| 

| 
NS 

8 
n 
& 


[511] 


=10 [512] 


and 


[513] 


Fic. 45. Conformal representation of the substitution given by Eqs. 502 and 504. 


For various values of 7, Eq. 512 represents a family of confocal ellipses, 
with foci at the points « = 0,» = + (that is, w = +7). These loci are 
shown in Fig. 45. For r = 1, the ellipse degenerates into the doubly 
traversed portion of the imaginary axis between the points w = -+4/. 
The same ellipse is evidently obtained for reciprocal values of 7, a very 
large or a very small value of 7 yielding a large ellipse with very little 
eccentricity, that is, one which is very nearly a circle concentric with the 
origin. One observes that the interior of the unit circle in the z-plane 
(r < 1) is mapped upon the entire w-plane and that the exterior of the 
unit circle in the z-plane (r > 1) is also mapped upon the entire w-plane. 
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The multivalued character of the function 504 is thus evident, inasmuch 
as two w-planes (the two leaves of a Riemann surface) are needed to 
map uniquely all points in the z-plane. The two leaves of the Riemann 
surface in the w-plane represent regions upon which the interior and 
the exterior of the unit circle in the z-plane are mapped respectively. 
The boundary between these two regions in the z-plane is the unit 
circle r = 1; in the w-plane it is the degenerate ellipse. The latter is the 
branch cut in the w-plane through which one passes from one leaf of the 
Riemann surface to the other. 

For various values of ¢, Eq. 513 represents a pair of families of confocal 
“hyperbolas, which are images of each other about the real axis. These 
loci, which are also shown in Fig. 45, are orthogonal to the ellipses 
defined by Eq. 512, and their foci also lie at the points w = +7. The 
asymptotes of any hyperbola on one of the leaves 
of the Riemann surface make angles with the 
positive real axis which are equal to the values 
of ¢ appropriate to the corresponding branches 
of that hyperbola. On the other leaf, the angle 
between an asymptote and the positive real 
axis is + — ¢. Suppose the top leaf to be that 
one on which the angle is ¢, and the lower one 
that on which the angle is + — ¢. Then if, for 
example, the branch of the hyperbola in the top 
leaf for ¢ = 30° (this is in the first quadrant) 
is traversed in the direction toward the branch 
Fic. 46. Path in the cyt, one finds, after following this hyperbola 
eee eee through the branch cut (into the second quad- 
according to the substitu. ant) that one is now on the lower leaf but still _ 
tion shown in Fig. 45. on an hyperbola to which ¢ = 30° is appropri- 

ate. If, instead of passing through the branch 
cut, one imagines jumping over it so as to remain on the top leaf, one 
finds oneself on an hyperbola (in the second quadrant) to which 
¢@ = 180° — 30° = 150° is appropriate. 

With these properties of the substitution 502 in mind, it is easily 
appreciated that the path in the z-plane corresponding to the imaginary 
axis in the w-plane, traversed from —j to +j (avoiding the points 
w = +7 by remaining slightly to the right of them), has the form shown 
in Fig. 46. The process of closing the path in the w-plane by means of a 
large semicircle in the manner shown in Fig. 43 corresponds to closing the 
path of Fig. 46 in the z-plane in the same manner.* The large semi- 


*It is true, of course, that a large circle concentric with the origin in the w-plane is not 
exactly (although very nearly) also a circle in the z-plane. This fact is, however, unimportant 
inasmuch as the argument regarding the closure of the path of integration does not require 
that the path increment in question be circular in form. 
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circles for ¢ <0 and é > 0 also lie in the right and left half planes respec- 
tively. The contour C in the integral 508 is thus determined. 

For ¢ < 0 the value of this integral is evidently zero, for the integrand 
has singularities only at the points ze = 0 and zg = ©. For? > 0 the con- 
tour C encloses the one singularity at z = 0. This contour may, therefore, 
be given any other form, as long as it encloses the origin in the z-plane. 
Choosing the unit circle for the path C and writing in conformance with 
Eq. 509, 


z= ei? [514] 
one has 
d: j , 
- = jd [515] 
and 
1 1 7 
i(:-2) =jsno [516] 
so that the integral 508 becomes 
f(at) es ~f eilatsind~—nd) do [517] 


This result is seen to be identical in form with the integral representation 
for the Bessel function, as given by Eq. 307. Hence one has for ¢ > 0, 


f(at) = J, (at) [518] 


It may be of interest to observe that the integral 508 alternatively 
represents the coefficients 6, in a Laurent expansion of the function 
e@1/4et/2 about its essential singularity at z = 0. One recognizes this 
fact from a comparison of Eq. 166 of Ch. VI with the integral 508, 
remembering that C is a contour enclosing the origin in the z-plane. As 
discussed in Art. 7, the substitution 514 converts the Laurent expansion 
into a complex Fourier series. One thus obtains the Fourier series rep- 
resentation for the function 303 dealt with in Art. 16. 

An additional interesting feature about the transformation from the 
integral 501 to the integral 508 by means of the substitution 502 deserves 
special mention. In examining the integrand in the integral 501 one 
observes (as pointed out above) that the points w = +/ not only are 
branch points but also are simple poles, since the factor V we + 1 be- 
comes zero there. As the corresponding points in the z-plane, which from 
Eq. 504 are seen to be z = -j, the integrand in the equivalent integral 
508, however, does not possess singularities. The reason for this peculi- 
arity lies in the fact that the function w(z) represented by the substitu- - 
tion 502 possesses saddle points (see Arts. 14 and 18, Ch. VI) atz = +). 
This fact becomes evident upon the forming of the derivative of Eq. 502, 
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dw i 1 e 
ee (1 + :) [519] 
This derivative has simple zeros for z = -47. Since the second derivative 
does not vanish at these points, they are saddle points of the first order. 
The vanishing of dw/dz at the points z = +/, corresponding to w = +7, 
may also be seen from Eq. 507. Since the quantity dz/z = dw/Vw* + 1 
remains regular in these points, the integral 508 does so likewise. For this 
reason it is unnecessary that the contour C of Fig. 46 be modified so as to 
avoid the points z = -+47; the path of integration in the z-plane may pass 
through these saddle points. 
It may also be of interest to recognize that the integral 508, except 
for the factor 4, is equivalent to Sommerfeld’s integral 306. In place of 
the relation 514, one uses the substitution 


z= ei [520] 


in which 7 is regarded as a new complex variable. Writing p for at and 
dropping the factor }4, one has the integral 


Five ! f gipsinr—wr) gy [521] 


This is simply an alternative form for Sommerfeld’s integral as given by 

Eq. 306. The latter form is obtained from Eq. 521 by making the addi- 

tional change of variable , 
feta. [522] 

2 . 

The minus sign resulting from the fact that df = —dr is unimportant 

inasmuch as it may evidently be canceled through traversing in opposite 

direction the path of integration, which as yet is not specified. 

For the various kinds of cylinder functions* defined by the integral 
521, the path of integration L begins and ends at infinity. In order to 
insure the convergence of the integral, it is necessary (assuming p > 0) 
that the portions of Z which extend toward infinity do so within regions 
of the r-plane in which the real part of 7 sin 7 remains negative. Letting 
r= ot Jn, one has . 

jsin (6 +jn) = — cos¢sinh » +7 sin ¢ cosh y [523] 


*For the demonstration showing that the function 521 formally satisfies Bessel’s equation 
(which is dissociated from the present discussion), the reader is referred to the literature on 
this subject, for example, R. Courant and D. Hilbert, Methoden der mathematischen Physik, 
I (Julius Springer, 1924), 382, or E. T. Copson, Theory of Functions of a Complex Variable 
(Oxford, 1935), 313. 
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whence it is readily recognized that the regions in which the real part is 
negative are those shown cross-hatched in Fig. 47. Paths such as those 
labeled Z, and ZL. may, therefore, be regarded as closing upon them- 
selves at infinity, so that the principles of contour integration become — 
applicable. 

- The detailed form of such a path within a cross-hatched region may, 
therefore, be modified at will without affecting the value of the integral. 
Thus the path Z, may alternatively be assumed to lie along the imagi- 
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Fic. 47. Appropriate paths of integration Fic, 48. Paths in the z-plane 

passing through saddle points, used in the corresponding to modified ver- 

approximate evaluation of Sommerfeld’s sions of Z, and Z, in the 7-plane 
integral. of Fig. 47. 


nary axis from 7 = © to 7 = 0, thence to lie along the real axis from 


¢ = 0 to ¢ = a, and from there to proceed toward y = — along the 
vertical line ¢ = x. Similarly, the path Z, may be assumed to lie along 
the line ¢ = —7 from 7 = ~—& to 7 = 0, thence to lie along the real 


axis from ¢ = ~x to ¢ = 0, and finally to proceed toward 7 = © along 
the imaginary axis. 

The paths C, and C, in the z-plane, corresponding respectively to these 
modified versions of the paths L, and L, in the 7-plane, are shown in 
Fig. 48 (with due allowance for slight departures necessitated by drawing 
both paths in the same figure). 

The specific functions defined by the integral 521 for the paths LZ, and 
L, are referred to as the Hankel functions or also as Bessel functions of 
the third kind. These are 


H, (0) = “ i gener de | [524] 
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and 
A, (p) ae ee dr [525] 
RIL, 


For real as well as complex values of p, these functions are complex. 
The conjugate value of H,,” (p) for real values of p is given by 


H,(p) == f entosinr—m) dy [526] 


in which every point on the path Z is the conjugate of a corresponding 
point on Ly, that is, LZ, is the image of L, with respect to the real axis. 
If +, the variable of integration, is replaced by —7, every point on the 
path of integration is replaced by its negative; that is, the path of integra- 
tion becomes replaced by its image about the origin. If the latter path is 
denoted by —JL,, one may write in place of Eq. 526 


H,, (p) = =f gio sin7—nr) d(—7) [527] 


It is now observed that the path —Z, is the image of L, about the 
imaginary axis. Reference to Fig. 47, therefore, shows that —Z, is 
identical with the path Lz except for a reversal of the direction in which 
it is traversed. Changing this direction merely reverses the algebraic 
sign of the result. Hence one has 


H,,© (p) = = [ea dr = HH, (p) [528] 


that is, for real values of p, the Hankel functions of the first and second 
kind are conjugate complex. 

The sum of these two functions may be expressed by a single integral 
of the form 521 in which the path Z is the resultant of the paths L; and Lz. 
Reference to Fig. 48 shows that the corresponding resultant path in the 
z-plane is the unit circle enclosing the origin. According to the preceding 
discussion this path yields the Bessel function J;,(p), hence 


In(p) = 4{Hn™ (0) + Hn (p)} [529] 


For real values of p, one may regard the Bessel function as the real part 
of either of the Hankel functions. The relation 529, however, is by defini- 
tion assumed to hold for complex as well as for real values of p. 

For the sake of completing the present picture, it may be mentioned 
that the so-called cylinder functions of the second kind (also called 
Neumann functions) are given in terms of the Hankel functions by a rela- 
tion complementary to Eq. 529, namely, 


Malo) = 5; [Hy (p) — Hy (o)} [530] 
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One observes that the Hankel functions are analogous to the exponential 
functions e” and e~*, the Bessel and Neumann functions to cos x and 
sin x respectively. 

Asymptotic expressions valid for large values of the argument p may 
be obtained through evaluating the integrals 524 and 525 by the so-called 
“ saddle-point ” method. In terms of the variable 7 defined by the sub- 
_ stitution 520, the function 502 reads 


w = jsin r = (dyn) +Jo(en) [531] 


The saddle points which, in the z-plane, occur for 2 = +, are located 
in the z-plane at the points r = --2/2. Reference to Fig. 47 shows that 
the paths Z; and Lz pass through these points. There the value of u is 
zero, whereas on either side of a saddle point u is negative. The exponen- 
tial function 


eresinr _. er(utir) $ [532] 
appearing in the integrals 524 and 525, has the magnitude 
jeer | see et [533] 


the value unity obtaining at a saddle point. 

If one is mindful of the general character of the contours in the 7-plane 
defined by « = constant and v = constant, as discussed in Art. 14, 
Ch. VI (in particular, see Fig. 10 of Ch. VI for s = 2), one observes 
that if the path of integration is chosen to coincide with that contour 
» = constant which passes through the saddle point, the function ~, and 
hence the exponential function given by Eq. 533, experience their most 
rapid rate of growth and subsequent decay. This fact is readily appre- 
ciated if one utilizes the analogy of regarding the loci « = constant as 
being contour lines in a mountainous terrain, and the orthogonal loci 
» = constant as indicating the direction of the gradient (direction of 
steepest ascent) in this terrain. The saddle point has the character of 
a mountain “ pass,” and the contour v = constant which passes through 
the saddle point represents the shortest route along which one may scale 
the height of the “ pass ” and descend into the valley beyond it. Clearly 
then, if this route is chosen as the path of integration, the values of the 
exponential function 533 pass continuously and most rapidly from the 
negligibly small magnitudes which obtain for points within the shaded 
regions of Fig. 47 remote from the origin, through their maximum (which 
occurs at the intersection of the path with the real axis), and back to 
negligibly small magnitudes again. 

It is also readily appreciated that the portion of the path (in the 
vicinity of a saddle point) throughout which the exponential function 
has appreciable values becomes shorter as p becomes larger. For large 
values of p, therefore, the principal contribution to the value of either 
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of the integrals 524 or 525 is furnished by a rather short path increment 
in the immediate vicinity of the respective saddle point. An integration. 
over such a short path increment alone then yields substantially the 
correct value of the desired function, the approximation becoming 
asymptotically better and better the larger the value of p. 

In order to be able to carry out such an integration, one must first 
determine the direction (in the 7r-plane) of the contour v = constant 
passing through the saddle point. From Eqs. 523 and 531 one has 


u = —cos¢sinh 7 
and [534] 
» = sin @ cosh 7 


At the saddle points r = +2/2 (that is, ¢ = +2/2, 4 = 0), so that, as 
pointed out above, uw = 0 and 


D=U = [ sin ¢ cosh of =s/2 = +1 [535] 
: 7 = 
Along a contour v = constant one has 
dv = 0 = cos ¢ cosh 7 d¢ + sin ¢ sinh n dy [536] 
from which 
cau = —tan¢tanhy [537] 
ay 


By use of Eq. 535 the values of this derivative at the saddle points are 
found to be 


ere eae [538] 
dy 


in which the minus sign applies to the saddle point at r = +2/2 and 
the plus sign to the one at +r = —1/2. Hence the path Z1, which passes 
through the saddle point at r = 2/2, should do so at an angle of —45 
degrees with respect to the real axis, whereas the path L, should pass 
through its saddle point at + = —7/2 at an angle of +45 degrees. The 
paths shown. in Fig. 47 comply with these conditions. 

Next it is necessary to determine the detailed behavior of the function 
u(o,n) in the immediate vicinity of the saddle points. This behavior 
must be expressed in terms of a variable which represents length measured 
along the respective paths of integration through these points. If this 
variable is denoted by s, the above determination of the path increments 
through the saddle points shows that one may write for the immediate 
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vicinities of these points 


r= + 3 + serine [539] 


whence 
w=jsin r = +7 cos (se¥?*!*) [540] 


Here s is regarded as a small quantity, so that one has approximately 
2 2 2 
wy & +i(1 = = vinit) = +j(1 +j5) = — 7 +7 [S41] 
and, therefore, 


“= —— [542] 
e 
The value of v, according to the result 541, of course, agrees with that 
expressed by Eq. 535. The exponential factor 532 finally becomes 


esine my etie. g-el2, = for ~—[s| <1 [543] 


in which the plus sign in the exponent is to be used in the integral 524 
and the minus sign in the integral 525. 

The exponential factor e¢~’"", which also appears in these integrals, 
may, for the short path increment over which the integration is ex- 
tended, be regarded as slowly variable compared with the factor 543, and 
hence as replaceable by its values at the saddle points. These are 
erina/2 : 

‘Observing, according to Eq. 539, that 

dr = eFitl4 ds [544] 


one then has for the desired asymptotic expressions for the integrals 
524 and 525 


Hy (p) & I ei(p—na/2~n/4) 5 ie EP 8/2 ds [545] 
© 


and ; 
1_s. ie 
Hep) mer eHearieei fMerreitds 546] 
v 


in which 2 is the length of the small path increment over which the 
integration extends. If the integrand e~°*/? is plotted versus s for large 
values of p, one finds that the area. under this curve is confined sub- 
stantially to the immediate vicinity of the origin (s = 0). In other words, 
the total area under this curve (which is obtained through integrating 
from s = — © tos = o) differs from the area within a small region in 
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the vicinity of the origin by an amount which becomes smaller and smaller 
as p becomes larger and larger. The total area is given by 


f * grt ds = [2% [547] 
aA \> 


Hence one has for the desired asymptotic expressions for the Hankel 
functions 


Hy,” (p) & 2. . eile—(2n-H1) 4/4) [548] 


and 


HH,” (p) ~ 2 - ¢dle—an Hh) 1/4] [549] 
c) ° ee 
According to Eq. 529, a corresponding asymptotic form, valid for 
large p, is obtained for the Bessel function 


In(o) =~ Ree (> - a : r) [550] 


For » = 0 and n = 1 this result yields the formulas given by Eqs. 322 
and 323 in Art. 16. 

It may be well to point out that the approximate expressions just 
derived do not yield accurate results if the parameter ” as well as the 
argument p is large, that is, if 7 and » are of the same order of magnitude. 
The truth of this statement is readily seen from the fact that if ” is also 
large, the factor e?”", appearing in the integrals 524 and 525, may no 
longer be regarded as essentially constant throughout the path increment 
over which the integration extends. 

For a more general treatment of the present problem, one begins by 
setting 


p= an [551] 
and rewriting the integrals 524 and 525 in the forms 
1 f a 
ce) aa jn(a sin. 7—1) 
H,“ (an) =~ J é dr [552] 
and 
1 ee 
(2) a jn(a sin r—r) 
H,, (an) “fe dr [553] 


In considering the saddle-point method of integration one then lets 
w=j(asine —7) =ut+ju [554] 
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The saddle points are those values of 7 for which 


Oa ai eo [555] 
dr 
yielding 
1 
cos tT = = [556] 


For large values of a (that is, for p >), Eq. 556 yields very nearly 
cos7=0 or r=a9 [557] 


which are the saddle points of the function 7 sin + considered above. 

The complete treatment of this problem (a being assumed real) re- 
quires separate consideration of the cases @ > 1, a = 1, anda < 1. One 
obtains, in this manner, representations for the cylinder functions in the 
form of semiconvergent series of which the approximate results derived 
above are the first terms. The further detailed discussion becomes too 
specialized to be included under the heading of the present article,* in 
which the primary objective is to consider the essential principles in- 
volved in the. use of complex integration for the evaluation of inverse 
Fourier transforms. 


PROBLEMS 


1. Make sketches of periodic functions which have the following specific char- 
acteristics: 
(a) The Fourier series contains only sine terms but all harmonics are present. 
(b) The Fourier series contains only sine terms and only odd harmonics. 
(c) The Fourier series contains only cosine terms but all harmonics are present. 
(d) The Fourier series contains only cosine terms and only odd harmonics. 
(e) The Fourier series contains sines and cosines but only odd harmonics. 
(f) The Fourier series has the property that the odd harmonics are sines and the 
even ones are cosines. 
(g) The Fourier series has the property that the odd harmonics are cosines and the 
even ones are sines. 
(h) The Fourier series has only even harmonics. 
(i) The Fourier series has harmonics of order a, 2a, 3a, +++, a being any fixed 
integer. 
2. Given a set of functions ;(¢) for k = 1, 2,+++, with the following properties, 
¢1(¢) is periodic, having the period r. 


bot) =oit—t), alt) =dill — ta), +++ Galt) = bit — tn) 


in which #, ta, -++t,~-1 are any finite quantities. Let the sum of these functions be 
denoted by $(¢), ie., 


nm 


o@) = XL gilt) 


k=1 


*This material. may be found, for example, in Courant-Hilbert, of. cit., pp. 436-440, or 
E. T. Copson, of. cit. pp. 330-336. 
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If the function $1(¢) has a Fourier series representation with the cosine and sine 
coefficients ay and by, respectively, show that the corresponding coefficients for the 
resultant periodic function $(t) are given by the expressions, 


Ao = nao 


n n 
Ay = dy >> COS Yul, — by >> sin vot, 
k=1 k=1 


n nr 
B, =b, ~ Cos put, + ay Pp» sin vost, 


Hint. Write the Fourier series for ¢,(¢) and $(t) in exponential form first and then 
convert to the trigonometric forms. 

3. Suppose, for the set of functions defined in the preceding problem, one chooses 
t, = k(1/n). The periodic functions ¢1, 2, --+¢, then form a cyclic group. Show 
that their sum ¢(¢), when it neither vanishes nor reduces to a constant, represents a - 
periodic function with the period 7/n. Show that the same is true of the function 
F(t) = 1 X $2 X +++ Kon given by the product of the functions forming the 
cyclic group. 

4. For the sum function $(¢) of the previous problem show that its Fourier coeffi- 
cients A, and B, are given in terms of a, and 6, for the component functions by the 
simple relationships: A, = nay, B, = nb,, for p = 0,1,2,--- andy = nu. Correlate 
this result with the formulas given in Prob. 2. 

5. As an application of the principles illustrated by the previous problems, let 


$x(f) = sin ot for 0 <i<= 


; Tv ar 

o(t) = 0 for <i< = 
and consider the cyclic groups for » = 2, 3, 6, 12, representing wave forms resulting 
from polyphase rectification. Compute the coefficients A, and B, according to the 
formulas of the previous problem and check, through carrying out the usual integra- 
tion. 

6. Square pulses of amplitude A and duration 6 = 7/10 characterize the periodic 
function 


T 
gilt) = A for O<t< 


r 
gilt) =0 for 75 <t<r 


Find the spectrum of this function and compare it with those obtained from a cyclic 
group form = 2 and m = 3 according to the principles given in the previous problems. 

7. In applying the formulas in Art. 15 to the numerical evaluation of Fourier 
coefficients for graphically given functions, a simplifying expedient is to alter the 
number of intervals according to the order of the Fourier coefficient being calculated 
instead of using the same fixed number of intervals for the calculation of all coefficients. 
For example, if, in the use of formulas 299 to 302, one chooses n = r, all the ay, and 
Bir become -41 or zero, and the resulting computations are correspondingly simplified. 
This procedure, known as the Fischer-Hinnen method, must evidently be applied 
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with care. The results are not good for the fundamental and lower harmonics but 
improve with increasing order. 

1 ee © 
78'9'6' "3 
respectively 0.0, 4.0, 7.0, 8.9, 9.7, 10, 9.5, 9.0, 8.7, 8.65, assuming that f(—x) = —f(x) 
and f(x +7) = —f(x). Compute harmonics through the seventh. Compare with 
results obtained from computations that do not utilize this simplified approach. Plot 
both results and compare with a plot of the given function. 

8. For increments in. x equal to 7/12, starting with x = 0, the values of.a given 
. function f(x) are: 


Try this method out on a function f(x) whose values for x = 0 are 


0, 4.2, 8.9, 9.9, 9.5, 8.9, 8.5, 8.9, 9.5, 9.9, 8.9, 4.2, 0, 
1.5, 1.8, 1.9, 2.1, 2.5, 3.0, 2.5, 2.1, 1.9, 1.8, 1.5, 0 


Choosing ” = 12, use the formulas 299 to 302 to compute the harmonics through 
the 11th. Plot the resulting partial sum and compare with the given function. 


9, Given the function f(x) = f(x + 2k) defined by 


x 
- forO<x<a 
wx 

f(x) = ae fora <*% <2" ~«@ 
ar — x 


for 21 ~a <x <2nr 


Show that for this function a, = 0 and 


2 sin na 
a(x — a)n* 


bn = 


From this result determine the Fourier series for the following functions having the 
same period: 


joy t=t for 0 < « < 2m 
22 for O<2<% 

T 2 

2 — x) € 3a 

f(x) = = for 7 << 2 
208 >) ae ee eee oe 

Te 2 
= for 0<x<7 
f(x) = 

== for 7 <2 <2n 


Make sketches of all the functions. 
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10. Find the Fourier expansion for a periodic function defined by 
0 for0 <t <a; (a <2) 
f® = sin t= fora<i<r—a 
0 forr -a<i<7r 


Make a plot of the function and find the form of the Fourier series corresponding to 
a = Q. Find the sum of the series for = aandt = 7 — a. 


11. A function f(f) is defined by 


0 forO<t<i—a 
aT T T 
f® = cos = (F -1) for; ~a<t<pt+a 


0 for +ba<t<r 


Determine the Fourier series expansion. Plot the function for @ = 7/5 as well as the 
partial sums 51, 52, -+° 5s, thus showing the manner in which the given function is 
approximated. Plot the spectrum function for the same value of a. 

12. A periodic function consists of a regular succession of identical pulses of short 
duration (similar to the function of Prob. 11 for a/r <1) the area under each pulse 
being A. Show that the values of the constant term and those of the fundamental and 
lower harmonic amplitudes are very nearly independent of the detailed pulse shape 
(whether rectangular, triangular, sinusoidal, etc.), being proportional only to the 
pulse area. Deduce the pertinent relationships. As the duration of the pulse is assumed 
to become shorter and shorter, the area remaining the same (=A), show that the 
Fourier coefficients are ultimately given by ao = A/t, @, = 24/7, independent of n. 

13. Check the expansions of the following functions for the interval 0 < x $7. 
Plot the functions and several terms of the series, noting rapidity of convergence. 

v — sin nx 
gun =) ae n3 


COs nX 
n=13,5,--- 14 


a ame 2 — 2x2) = 
og * 2x) (x? + 2x — 2x?) 


14. Consider f(x) = (44) (4 — x) sin x within the interval 0 < x <7, and check the 
following series representation: ; 


fe iiadat a Nes 2x — d — cos 3x — : = cos 44 — 
2°74 ise 24 3-5 


15. By using the Cauchy principle of convergence, show that the series 


o os 
Sa x sin 2X 
n=1 MN 


converges uniformly except at the points x = 0, 27, 4z,- 
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16. Determine the regions of uniform convergence of the series 


oe 
COS NX 
s= yom 
n=0 nN 


and define the points at which it diverges. 
17, Show that the series 


a 
Le*cosns and Dz"sinnsz 
0 


converge absolutely and uniformly inside a circle of unit radius. 
18. Show that the series 


oO 
Daw" cosnz and D> ags"sin nz 
0 


converge absolutely and uniformly inside a circle of radius 


R = limit |a,|-1/" 
no 


. 


19. Discuss the convergence of the series 


in which @y > Gn41 > Gn42°-+ and a, > 0 for n > &. Does the point x = 0 belong 
to the region of uniform convergence? 
20. Show that the series 


: sin 2x sin 3x sin "x 
sin x + a ees 
2. + 3 
converges uniformly except at points x = 0, +27, +47,---. 
21. Show thatrthe series 


C4 


COS NX 


S = 2 
n=13,5,-°° 


converges uniformly to the function (7 — 2x)m/8 in the interval 0 < x <7 and to 
the function (x — 3/2)m/4 in the interval r S x < 27. 
22. Show that the expansion 


fa? 1 1 
(5 ~ *) = COS x ~ 73 008 2x + 33 00s 3x ~ 7y008 de + —-:- 


converges uniformly in the interval —x < x <7. Through an obvious change of 
variable obtain the series 


2 2 
gig ee Ee $e] 
3. ox? c ¢ c 
and state its interval of uniform convergence. 
23. Using the results of Prob. 22, show that 
wr? 1 1 1 
2 Rts He 
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and 

a? 1 i of 

ae 1+ 32 + 32 + 42 + 
24. Using the results of Prob. 22, show that the series 


c. 4¢ we 1 3rx Sax 
de eae Me ge + gs bos ~~ ore 
converges uniformly within the interval 0 Se c, and obtain the result 


wr? 1 1 1 
rs el+gtatat: 
25. Through the use of a Laurent expansion obtain the series 


1—rcosz 


1 eae =1+9rcosz +7? cos 22 +r? cos 3g ++°* 


in which ¢ is real but z may be complex. Show that the region of convergence is defined 
as -it <r <i. . 


26. Find the coritinuous spectrum for the function 
0 fort <0 
fth=) 
Ae sin wot for0 St < © 


Plot the result for A = 10, wy = 2m X 10°, = 2 x 104 
27. Find the Fourier transforms of the following functions: 


(a) {f@) =O for? <0 
f®) =a ford <i<t 
f@ =a -b¢-h) forti<t<te =n(1 +4) 
f® =0 foriz<i<o 
(b) (f@) =0 fort<4>0 
IQ =e) forts <t <te 
f® =0 foriz<i< © 
28. Let G(w) = 2rg(w) and introduce F(t) =eo* X f(t) in which o is a real quantity. 
Then letting s = o + jw, obtain from the Fourier transforms the Laplace transforms 


1 re +7 00 
Fi) = mah ; G(s)e* ds 
o—j 2 


G(s) = f 4 Fite dt 


Show that the condition for the existence of the transform G(s) may be expressed by 
stating that the integral 


f - je~*tF(@)| dt remains bounded 
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Thus show that if |F(¢)| < Ce“, in which C and c are positive real constants, the 
transform G(s) exists so long aso > c. (The quantity c is, therefore, called the abscisss 
of uniform convergence.) 


29. Find the abscissa of uniform convergence for the following functions 


; 1 1 1 : 1 1 1 1 
Sepp taal: «wale. See a og ee coe 
5?’ 3’ sta’ s—a Gane = 2)" (s +a)?’ (s—a)?’ 
s* i e 


(s +a)(s —a)?’ Wii V/s+avs—-b° s—a 


30. With the interpretation given in Prob. 28, and assuming F(t) = 0 for t < 0, 
show that the transforms of the following functions: 


e, ee, sinat, cosat, sinhat, coshat, ¢* sinh bi, 
é* cosh bt, 7, #, tsinat, tcosat (in which ae and b 
are positive real constants) 

‘ do exist; and by direct integration obtain for G(s) respectively: 


a 1 a s a s b 
GS Sue Sa! Sa! Bae sae weather: 
Ss s-—-a@ s*+a st+ta s?-—a sta (s +a)? —b 
S+a 1 n! 2as s? ~ @? 


(sa)?—B?’ 32? sett? (G2 G2)?’ G2 4 ad)? 


31, Through contour integration find F(t) corresponding to 


1 
SO) = Cae -ne-o 


in which a, b, c are real (unequal) quantities. Without further direct integration, 

what are the time functions corresponding to the following transforms: 
gg ae ns Pea nce A nes aes 
(s—a)(s—b)(s—c) (s—ay(s— d)(s—c) s(s —a)(s — b)(s — c) 


32. Starting with the pair of transforms 
Fé) =e G(s) = 


~@ 


find through convolution the time functions corresponding to 


1 1 1 1 
(—ay?’ (s—a)s—b) (S—aj(s+b) (sta)(st+b(s+c) 


et 
(s +.4)(s — b)(s — ¢) 
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Abel, 285 
Abridged quadratic form, 165 
Absolute convergence, 277 
circle, 283 
Adjoint, 42, 47 
Admittance function, 359 
Affine co-ordinates, 85 
scale of length in, 88 
transformation, 85 
Algebra, fundamental law of, 325 
Algebraic functions, 314, 318, 321 
singularities, 318 
Alternating component, of a periodic func- 
tion, 471 
Alternatives, rule of, 105 
Altitude function, 183, 195, 351 
Amplitude spectrum, 473 
Analytic continuation, 288, 549 
functions, 257 
Approximation, Bessel functions, 509, 568 
Fejér polynomials, 496 
Fourier series, 483, 485 
Gibbs phenomenon, 495 
orthogonal functions, 442, 459, 501 
‘uniform tolerance for, 502 
Arc, differential, 198, 235 
Arithmetic mean sequence, 285 
Augmented matrix, 64 
Axes, principal, 137, 156, 159 
semi-, 140, 144 . 
co-ordinate (see Co-ordinates) 
Axial vector, 183 


B 


Beating, 513 
Bessel equation, 440, 562 
functions, 440, 506, 561 
approximation, 509, 568 
relation to Fourier series, 506 
Bierens de Haan, 345 
Bilinear form, 134 
transformation, 363 
Bipolar circle, 372 
Birkhoff, G., 142 
Bocher, M., 156 
Bode, H. W., 343, 346 


Bolzano-Weierstrass theorem, 276 
Bordered determinant, 11 
Borel, 285 
Boundary conditions, 207, 379, 440 
distributions, 441 
-value problems, 207, 379, 387 
Branch cut, 315 
integration across, 316, 560 
point, 296, 314 
definition, 314 
integration around, 310, 316, 560 
interpretation as a vortex, 351 
logarithmic, 319 


Cc 


Canonic form, matrix, 61 
quadratic form, 150, 154, 156, 159, 172 
Carathéodory, C., 1 
Cartesian co-ordinates, 85, 100, 137, 159, 184, 
188, 235 
Casorati-Weierstrass, theorem of, 296 
Cauchy integral formula, 272 
integral law, 267 
principal) value of an integral, 339 
principle of convergence, 277, 466 
residue theorem, 302 
-Riemann equations, 256 
’ theorem on the number of zeros in a region, 
407 
Cayley-Hamilton theorem, 118 
Central conics, 135 
Cesaro sum of a series, 285, 496 
Chain, dipole, 358 
Characteristic determinant, 111, 141, 162 
equation of a matrix, 62, 111 
function of a matrix, 111 
matrix, 111, 141 
values of a matrix (see Latent roots) 
Charge density, 204, 291, 349 
Checkerboard rule, in determinants, 7 
Christoffel, 378 
Circle, bipolar, 372 
convergence, 283, 288 
transformation of, by fractional transfor- 
mation, 366 4 
unit, 284, 361, 369, 417, 419 
Cluster point, 276 
Cofactor, definition, 6 
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Cofactor, minor, 6 
relation to direction cosines, 112, 162 
use in inverting a matrix, 41 
use in solving equations, 13 
Collineation, 117 (see also Linear transfor- 
mation) 
Collineatory transformation, 117, 137 
effect on latent roots, 118 
Column matrix, 32 
Comparison test, for series, 279 
Complex integration (see Integration) 
plane, 253, 258, 262 
simple, 365 
sphere, 262, 317, 376 
variables, 253 
conjugate value, 257 
definition, 253 
functions of (see Functions of a complex 
variable) 
polar representation, 253 
Condensation, point of, 276 
Condenser, 387 
Conditional convergence, 281 
Conditioned maxima, method of, 143, 211 
Conformal mapping, 258, 326, 330, 360, 378 
algebraic function, 315, 318 
at a zero or saddle point, 300 
circle, 363 
complex sphere, 263 
general function, 380 
inverse function, 326 
inversion, 331, 361 
linear fractional function, 363 
logarithm function, 310, 325 
polygon, 383 
polynomial functions, 397 
reciprocal function, 360 
reflection, 361 
Schwarz-Christoffel transformation, 378 
use of exponential function, 379, 391 
Congruent transformation, 136, 149, 157 
Conics, 135 
Conjugate diameters of an ellipse, 160 
matrix, 43 
potential functions, 333, 351 
Constraint, plane, 165 
vector, 166 
Constraints, linear, on a quadratic form, 165, 
172 
Continuant, 21 
Continuation, analytic, 288, 549 
Continued fraction expansion, 403 


Contour integration, 199, 204, 223, 264, 268, 


548 
functions of a complex variable (see 
Integration) 
functions of a real variable, evaluation, 
432, 548 
lines, 196 
map, 196 


Contragredient sets of variables, 95 
Contravariant components of a vector, 95, 
159 
Convergence, series (see Series, convergence) 
uniform, 282, 465, 500 
abscissa of, 575 
Convolution, 530 
Co-ordinates, affine, 85 
scale of length in, 88 
Cartesian, 85, 100, 137, 159, 184, 188, 235 
cylindrical, 234, 238 
general, 28, 234, 439 
normal, 111 ; 
orthogonal, 81, 85, 97, 13/7, 140, 165, 184, 
234, (see also Co-ordinates, Car- 
tesian) 
reciprocal, 90 
rectangular (see Co-ordinates, Cartesian) 
right- and left-hand, 84, 184 
spherical polar, 234, 239 
transformation of, effect on latent roots, 
118 
effect on quadratic form, 135, 165 
oblique systems, 85, 97 
orthogonal systems, 81, 166, 184, 234 
Copson, E. T., 562, 569 
Couple, 543 
Courant, R., 142, 562, 569 
Covariant components of a vector, 95 
Cramer’s rule, 15 
Critical points, 382 
Cross product, 190 
Crout, P. D., 64 
Curl, in curvilinear co-ordinates, 243 
definition, 208 . 
determinantal form, 213 
divergence of, 218 
of the gradient, 213 
interpretation as a linear vector trans- 
formation, 228 
surface, 217 
of the vector r, 234 
Curvilinear co-ordinates, 89, 234 
curl in, 243 
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Curvilinear co-ordinates, divergence in, 242 
gradient in, 240 

Cylinder functions, 440, 507, 562 

Cylindrical co-ordinates, 234, 238 


D 


d’Alembert ratio test, 278 
Dedekind test for convergence, 282 
Definiteness of quadratic forms, 144, 149, 150, 
164, 171 
Degeneracy (see also Nullity, Rank) 
matrix, 61, 105, 109, 137 
degree of, 61 
quadratic form, 137 
Del, 197 
Dependence, linear, algebraic equations, 18, 
100 
vector fields, 186 
vector set, 77, 100, 114, 143 
Derivative, directional, 196 
of a function of a complex variable (see 
Functions of a complex variable, 
derivative) 
normal, 196 
of the singularity functions, 542, 546 
Determinant, bordered, 11 
characteristic, 111, 141, 162 
checkerboard rule, 7 
cofactor, 6, 40, 112, 162 
Cramer’s rule, 15 
definition, 1 
development of, 7 
evaluation, 3, 9 
fundamental properties, 1 
Gramian, 146, 151 
Laplace’s development, 7 
of a matrix, 31, 40, 137 
minor, 4, 19, 56 
principal, 5, 154, 406 
multiplication, 11 
_ order, 1. 
product, 11 
rank, 18, 39, 61, 78 
symmetrical, 15 
triangular form, 7 
use, in solving linear equations, 13 
Wronskian, 25 
Determinantal form, curl, 213 
Hurwitz criterion for polynomials, 405 
scalar triple product, 194 
vector product, 192 
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Detour, semicircular, in contour integration, 
382, 549, 552, 560 
Diagonal form, reduction of a matrix to, 59, 
111, 137, 146, 152 
relation to quadratic form, 137, 143 
Dickson, L. E., 119 
Differential arc, 198, 235 
equations, Bessel’s, 440 
Cauchy-Riemann, 256 
Laplace’s, 207, 243, 266, 378, 439 
Legendre’s, 440 
Poisson’s, 207 
solution, 441 
wave, 440 
rectangle, 212, 269 
vector, 198, 201, 235 
volume, 202, 209, 241 
Differentiation, functions of a complex vari- 
able, 254, 275 
of a power series, 284 
of a vector, 225, 232 
Dipole, 216, 353 
chain, 353 
moment, 353 
Direction cosines, 82, 101, 112, 162 
parameters, 126 
Directional derivative, 196 
Dirichlet conditions, 463, 511, 520, 527 
Discriminant, 132, 137, 146, 154 
Distribution, boundary, 441 
initial, 441 
source, 200, 206, 291 
Divergence, 200, 228, 264. 
of the curl, 218 
in curvilinear co-ordinates, 242 
definition, 201 
of the gradient, 206, 220 
surface, 206 
of the vector r, 233 
Dominant series, 279 
Dot product, 188 
Double stratum, 357 
Doublet, 353 
unit, 542, 546 


E 


Effective value, 478 
Fourier transform of, 530 
Eigenwerte, 140 (see also Latent roots) 
Electric charge density, 204, 291, 349 
field, 184, 349, 387 
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Electric charge density, field, intensity, 349 
networks, 158, 346, 359, 410, 416, 477, 481 
transmission line, 441, 452, 555 

Elementary functions, differentiability, 257 
transformation matrix, 54, 146 

Ellipsoidal surface, 138, 144, 149, 156, 165 

Elliptic transformation, 368 

Energy functions, 156, 184 
potential, 200, 222 

Entire functions, 297, 321 

Envelope functions, 515 

Equations, characteristic, of matrices, 62, 111 
differential, Bessel’s, 440, 562 

Cauchy-Riemann, 256 
Laplace’s, 207, 243, 266, 378, 439 
Legendre’s, 440 
Poisson’s, 207 
solution, 441 
wave, 440 
linear (see Linear algebraic equations) 
Error, 483, 502 
function, 544 
Essential singularities, 296 
Euler, 285 
Even functions, 340, 341, 398, 412, 450 
harmonics, 454 
Expansion (see also Series) 
continued fraction, 403 
of functions in series, 286, 290, 305, 307, 
415, 441 

orthogonal, 442 

partial fraction, 307, 415 

Exponential functions, in conformal map- 

ping, 379, 391 

Fourier series, 460, 531 

orthogonality, 444 

singularities, 296 


F 


Fejér polynomials, 496 
relation to Fourier series, 497 
uniform convergence of, 500 

Field, electric, 184, 349, 387 
force, 183, 200 
lamellar, 185 
map, 387, 391 
potential, 185, 200, 218, 221, 231, 234, 263, 

333, 351 

scalar, 183 
source, 185 
source-free, 185, 207, 218, 265 
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Field, vector (see Vector field) 
vortex, 186 
Filamental source distribution, 204 
Fischer-Hinnen method, 570 
Flow lines, 184, 267 
source of, 185, 203, 388, 392 
map, 185, 388 
Flux density, magnetic, 349 
Force field, 183, 200 
gravitational, 200 
vector, 198, 208 
work due to, 198 
Formula, Cauchy’s integral, 272 
Schwarz-Christoffel, 380 
summation, 479 
Fourier analysis, 501, 519 
graphical, 501 
integral, 517 
alternative forms, 522, 523 
derivation from Fourier series, 
537 
elementary properties, 524 
Gibbs phenomenon, 522 
series, alternating component, 471 
approximation with, 483, 485, 501 
coefficients, 449, 460, 472 
complex, 460, 472 
convergence, 459, 464 
cosine, 450 
definition, 448 . 
Dirichlet conditions, 463, 511, 520, 527 
for even function, 450 
exponential form, 460 
fundamental period, 452 
Gibbs phenomenon, 495 
harmonics, 453, 454, 503 
least-squares property, 483 
for odd function, 450 
product of, 477 
rectangular wave, 470, 475 
relation to Bessel functions, 506 
relation to Fejér polynomials, 497 
relation to Fourier integral, 518, 537 
relation to Laurent series, 459 
saw-tooth wave, 456, 464 
sine, 450 
summation, 479 
triangluar wave, 471 
in two variables, 511 
value at a discontinuity, 464, 485, 500 
spectrum, 473 
synthesis, 519 


518, 
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Fourier transform, inverse, 473, 519 
of a product, 528 
Fractional function (see Linear fractional 
transformation) 
Frequency, 410, 452, 523 
domain, 472, 519 
groups, 512 
spectrum, 473, 514, 517 
Functions, altitude, 183, 195, 351 
Bessel, 440, 506, 561 
characteristic, 111 
complex variables (see Complex variables) 
conjugate potential, 333, 351 
cylinder, 440, 507, 562 
energy, 156, 184 
envelope, 515 
error, 544 
even, 340, 341, 398, 412, 450 
_ expansions in series, 286, 290, 305, 307, 415 
exponential, time, 531 
fractional (see Linear fractional trans- 
formation) 
Hankel, 508 
mapping (see Conformal mapping) 
Neumann, 564 
odd, 340, 341, 398, 412, 450 
orthogonal, 442, 459, 501 
orthonormal, 442 
periodic, 452, 477, (see also Fourier series) 
permanence of form of, 290 
positive real (see Positive real functions) 
potential (see Potential function) 
proper, 439, 440 
pseudoscalar, 184 
pulse, 474, 494, 499, 521, 536 
real variable, integration of, 432 
scanning, 488, 498 
sequence of, 285 
singularity, 531, 544, 553 
spectrum representation, 473, 519 
transient, 517 
trigonometric, 439 
vector (see Vector function) 
Functions of a complex variable, algebraic, 
341, 318, 321. 
analytic, 257 
branches, 318 
classification, 295, 317 
continuity, 258 
definition, 254, 256 
derivative, continuity, 257 
existence of, 257, 275 


Functions of a complex variable, derivative, 


order, 275 
uniqueness, 255 
differential condition equations, 256 
differentiation, 254, 275 , 
entire, 297, 321 
graphical representation, 253, 258, 267 
holomorphic, 257 
identity theorem, 290 
impedance, 359, 410, 416 
integral, 297, 548 
integration (see Integration) 
inverse, 260, 321, 326 
logarithm, 310, 313, 325 
mapping, conformal (see Conformal map- 
ping) 
isogonal, 259 
maximum and minimum values in a region, 
328, 334 
maximum modulus, principle of, 328, 36 
417 
mermorphic, 297, 321 
multivalued, 261, 289, 296, 310, 317, 321, 
326 
natural boundary, 288 
positive real (see Positive real functions) 
rational, 297, 307, 321, 359, 398, 410 
reciprocal, 305, 315, 360 
regular, 257 
relation between real and imaginary parts 
of, 256, 333, 339, 351 
schlict, 365 
singularities, algebraic, 318 
branch points, 315, 319 


definition, 257 

effect on convergence of Taylor’s series, 
287, 288 

essential, 296 


infinitely dense, 288, 291 
integration around, 290, 304 
interpretation as vortexes, 265, 351 
isolated, 291 
logarithmic, 296, 345 
poles, 296, 323 
residue at, 304 
series expansions about, 290, 319 . 
types, 295 

transcendental, 297, 321 

uniqueness, 290 

uniqueness theorem, 290 


Fundamental component of a periodic func- 


tion, 453 


582 


Fundamental law of algebra, 325 
metric tensor, 92, 151 
period, 452 
properties of determinants, 1 


G 


Gaussian plane, 253 
Gauss’s law, 203, 217, 264 
Gibbs notation for vectors, 188, 190 
phenomenon, 495, 522, 535 
Gradient, 183, 195, 226 
curl of, 213 
in curvilinear co-ordinates, 240 
definition, 196 
divergence of, 206, 220 
notation, 197 
operator, 226, 230, 232 
of a scalar product, 230. 
of the vector r, 233 
Gramian determinant, 146, 151 
Gravitational force, 200 


H 


Hamiltonian operator, 197, 203, 214 
Hankel functions, 508, 563, 568 
Harmonics, amplitudes, 453, 502 
analysis, 501 
even, 454 
odd, 454 
phase angles of, 453, 462, 526 
spherical, 440 
Heaviside Operational Calculus, 541 
Hilbert, D., 142, 562, 569 
transforms, 339 
application to network theory, 346 
degenerate forms, 343 
Holder, 285 
Holomorphic functions, 257 
Homographic transformation, 364 
Hurwitz criteria, 395 
polynomial, definition, 395 
even and odd parts, 398, 412 
importance in positive real functions, 411 
properties, 397, 399, 401 
test for Hurwitz character, 401, 403 
determinantal form, 405 
zeros, 395 
Hydrodynamic analogy, 184, 201, 204, 208, 
216, 263, 300, 388 
Hyperbolic transformation, 367 
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Identity matrix, 38 
theorem, 290 
Image, 331, 361 . 
Impedance function, 359, 410, 416, 48 
Impulse, unit, 539 
Independence, linear, algebraic equations, 
18, 100 
vector fields, 186 
vector set, 77, 100, 114, 143 
Inertia, law of, 148 
Infinite series (see Series, infinite) 
Infinity, point at, 262 
integration around, 307, 549 
vertex of polygon at, 384 
Initial conditions, 441 
Inner product, 188 
Integrability, 463 
Integral, Cauchy principal value of, 339 
formula, Cauchy’s, 272 
Fourier, 517, 548 
derivation from Fourier series, 518 
functions, 297 
Laplace’s, 548 
law, Cauchy’s, 267 
line, 198, 267, 311 
Poisson’s, 330, 379 
sine-, 491, 501, 521, 534 
Sommerfeld, 507, 562 
Integration, contour, 199, 204, 223, 264, 268, 
548 : 
functions of a real variable, evaluation, 
432 
vector function, 199, 264 
functions of a complex variable, across a 
branch cut, 316, 560 
around a branch point, 310, 316, 560 
around the point at infinity, 307, 549 
around singularities, 290, 304, 382, 549, 
552, 560 
Cauchy’s integral formuia, 272 
Cauchy’s integral law, 267, 548 
Cauchy’s residue theorem, 302 
independence of path, 268 
over a regular region, 267, 290 
power series, 284, 303 
saddle-point method, 565 
semicircular detour, 382, 549, 552, 560 
line, 198, 267, 311 
surface, 201, 204, 209, 215 
volume, 204, 230, 264 
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Inverse Fourier transform, 473, 519 

function of a complex variable, 260, 321, 
326 

- linear transformation, 15, 39, 81 
matrix, 39, 44, 47, 57 

Inversion, graphical, 331, 361, 
matrix, 41, 50, 63 

Trrotational field, 185 

Isogonal mapping, 259 

Iterated quadratic form, 155 


J 


jp-axis, 253 
Jacobian, 26, 237 


L 


Lagrange’s identity, 245 
Lagrangian multiplier, 143, 211 
Lamellar field, 185 
Laplace’s development of determinants, 7 
equation, 207, 243, 378, 439 
in Cartesian co-ordinates, 207 
in curvilinear co-ordinates, 243 
solution, 207 
in two dimensions, 266, 378 
integral, 548 
transform, 548 
Laplacian, 207, 220 
in curvilinear co-ordinates, 243 
operator, 206, 233 
Latent roots, associated with a positive defi- 
nite quadratic form, 149, 171 
associated with a quadratic form, 139, 149, 
157, 162, 171 
definition, 62 
distinctness of, 112, 141 
invariance with collineatory transforma- 
tion, 118 
of one matrix with respect to another 
matrix, 158, 162 
power of a matrix, 155 
relation to quadric surfaces, 139 
separation property of, 174 
symmetrical matrix, 121 
Laurent series (expansion), 290, 305, 319, 
324, 410, 459 
ascending part, 294 
convergence, 293 
descending part, 294 
integration of, 303 
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Laurent series (expansion), near an algebraic 
singularity, 319 
principal part, 294, 308 
relation to Fourier series, 459 
relation to Taylor’s series, 290, 294 
Law of inertia, 148 
Least-squares approximation, 483, 501 
Left-handed co-ordinate system, 84, 184 
Legendre’s equation, 440 
Limit point, 276 
Line integral, 198, 267, 311 
spectrum, 473, 514 
transmission, 441, 452 
Linear algebraic equations (see also Linear 
transformation) 
equivalent set, 64 
homogeneous, 17 
independence, 18, 100 
inverse set, 14, 39, 81 
solution, Cramer’s rule, 15 
existence of, 16, 40, 100, 104, 115 
use of determinants, 13 
use of matrices, 39, 49, 63, (see also 
Matrix, inversion) 
Linear dependence and independence, of 
algebraic equations, 18, 100 
of vector fields, 186 
of a vector set, 77, 100, 114, 143 
Linear form, 133 
Linear fractional transformation, 363, 397, 
417 
fixed points of, 365 
interpretation on complex sphere, 376 
limitations of, 337, 365, 376 
properties of, 367, 373, 375 
of schlict functions, 365 
Linear transformation, 30, 76 (see also Linear 
algebraic equations) 
affine, 85 
congruent, 136, 149, 157 
effect on quadratic form, 135 
inverse, 15, 39, 81 
matrix of, 30, 54, 76, 136, 146, 157 
of a matrix, 54, 76, 136, 146, 149, 157, 
227 
orthogonal, 62, 81, 137, 156 
quadratic form of, 132, 151, 161 
vector significance of, 79, 186 
of vectors, 186, 227 - 
Logarithm, of a function of a complex vari- 
able, 310, 325 
principal value of, 313 
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Logarithmic branch point, 319 
singularity, 296, 345 


M 


M.LT. Staff, 185, 207 
MacLane, S., 142 ; 
Maclaurin’s series, 287, 330, 352 
Magnetic flux density, 349 
Map, contour, 196 
field, 387, 391 
flow, 185, 388 
Mapping, conformal (see Conformal mapping) 
function, 380 
isogonal, 259 
surfaces, 195 
Matrix, addition, 35 
adjoint, 42, 47 
augmented, 64 
canonical form, 61 
characteristic, 111, 141 
characteristic equation, 62, 111 
characteristic function, 111 
characteristic values, 62, 111 
column, 32 
conformability, 36, 49 
congruent transformation of, 136 
conjugate, 43 
definition, 30 
degenerate, 61, 105, 109, 137 
degree of degeneracy, 61 
determinant of, 31, 40, 137 
sign, 45, 84 
diagonal, 38, 42 
division by, 40 
Eigenwerte of, 140 
equality of, 32 
equivalence, 58 
identity, 38 
inverse, 39, 44, 47, 57 
inversion, 41, 50, 63 
latent roots (see Latent roots) 
of a linear transformation, 30, 54, 76, 136, 
140, 157 
modal, 116, 149, 155, 162 
multiplication, 31, 35 
nonsingular, 39 
null, 107 
nullity, 62, 109 
order, 31 
orthogonal, 45, 48, 57, 62, 84, 137, 148, 
166, 237, 442 
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Matrix, partitioned, 48 
powers of, 120, 155 
Cayley-Hamilton theorem, 118 
product, 31, 35 
null, 107 
proper values, 140 
of a quadratic form, 132 
rank, 39, 43, 61, 105, 114, 159 
reciprocal, 45, 48 
reduction to diagonal form, 59, 111, 137, 
146, 152 : 
relation to quadratic form, 137, 143 
row, 31 
scalar, 38 
singular, 39, 47 
skew symmetrical, 39 
submatrices, 48 
symmetrical, 39, 115 
of tensors, 187 
transformation, 54, 76, 136, 146, 149, 157, 
227 
transpose, 43, 46 
triangular, 67, 152, 154 
unit, 38, 54 
use in solving linear equations, 39, 49, 63 
Maximum modulus, principle of, 328, 334, 
417 
Mean square value, 478, 483, 530 
Membrane, 441 
Mermorphic functions, 297, 321 
Minor, 4, 19, 59 (see also Cofactor) 
complement of, 5 
principal, 5, 154, 406 
Modal matrix, 116, 149, 155, 157, 162 
Modulation, 526 
Moment, dipole, 353 
vector, 126, 354 
Multiplicity of poles and zeros, 296, 298 
Multiply connected region, 221, 223, 271 
Multivalued functions, 221,.261, 289, 296, 
310, 317, 321, 326 
potential, 221 
Schwarz-Christoffel transformation, 382 


N 


n-dimensional space, 76, 90, 134, 167 
Natural boundary, 288, 422 
Network, lossless, 416, 555 

theory, 158, 346, 359, 410, 416, 477, 481 
Neumann functions, 564 
Nonturbulent field, 185, 187, 213, 265 
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Normal co-ordinates, 111 
derivative, 196 
form, of a quadratic form (see Canonic 
form) 
Normalization of functions, 442 
of vectors, 89 
Null matrix, 107 
Nullity, 62 
Sylvester’s law of, 109 
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Oblique co-ordinates, 85, 97, 159 
Odd functions, 340, 341, 398, 412, 450 
harmonics, 454 
Operator, gradient, 226, 230, 232 
Hamiltonian, 197, 203, 214 
Laplacian, 206, 233, 439 
Order, determinant, 1 
matrix, 31 
tensor, 187 
Orthogonal circles, in linear fractional trans- 
formation, 366 
co-ordinates, 81, 85, 97, 137, 140, |165, 184, 
234 (see also Co-ordinates, Cartesian) 
expansions, 442 
families of curves, 267 
functions, 442 
linear transformation, 62 81, 137, 156 
matrix, 45, 48, 57, 62, 84, 137, 148, 166, 
237, 442 
polynomials, 442 
vectors, 81, 84, 107, 121, 168, 189, 190, 266 
vector sets, 81, 121, 168 
Orthogonality, conditions, 442 
exponential functions, 444 
trigonometric functions, 444 
Orthonormal functions, 442 


P 


p-x. function (see Positive real functions) 
Parallelepiped, curvilinear, 241 
volume, in terms of vector product, 194 
Parallelogram, area of, in terms of vectors, 
190 
law of addition, 188 
Partial fraction expansion, 307, 415 
Partitioned matrix, 48 
Periodic functions, 452 (see also Fourier series) 
alternating component, 471 
beating, 513 


Periodic functions, effective value, 478 
harmonics, 453 
mean square value, 478 
product, 477 
Permanence of form of functions, 290 
Phase angle, harmonics of periodic function, 
453, 462, 526 
spectrum, 473 
Plane, complex, 253, 262, 268 
constraint, 165 
z-plane, 258 
Point, branch, 296, 314, 319, 351 
cluster, 276 
of condensation, 276 
fixed, 365 
at infinity, 262, 307, 384 
limit, 276 
saddle, 298, 321 
set, 276 
singular, 296 
source, 204 
of stagnation, 298 
stationary, 144, 172 
winding, 314 
Poisson’s equation, 207 
integrals, 330, 379 
Polar vector, 183 
Poles, 296, 323 
detection of, 326 
multiplicity, 296 
of a positive real function, 410, 414 
separation property, 400 
Polygon, mapping of, 383 
vertex angles, 385 
Polynomial, conformal mapping, 397 
differentiability, 257 
Fejér, 496 
Hurwitz, 395, 401, 411 
orthogonal, 442 
in rational fractions, 308, 359, 398, 410 
trigonometric, 436 
Tschebyscheff, 431 
zeros of, 308, 324 
Positive definite quadratic. form, 144, 150, 
164, 171 
latent roots of, 149, 171 . 
Positive real functions, definition, 409 
polar form of, 419 
poles and zeros, 410, 414 
properties, 411, 412, 417, 419 
relation to Hurwitz polynomials, 411 
residues, 411, 414 
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Potential energy, 200, 222 Quadratic form, reduction tc sum of squares, 
field, 185, 200, 218, 221, 231, 234, 263, 333, single form, 137, 146 
351 two forms, 156, 159 
function, conjugate, 333, 351 signature, 148 
multivalued, 221 stationary points of, 144, 172 
scalar, 183, 195, 199, 206, 221, 439 Quadratic surface (see Quadric surface) 
vector, 218, 263, 266 Quadric surface, central, 135 
theory, 330, 349, 439 ellipsoidal, 138, 144, 149, 156, 165 
dynamic, 439 principal axes of, 137, 156, 159 
Power, average, 478, 530 relation of latent roots to, 139 
factor, 478 semiaxes, 140, 144 
product, 477, 428 vector interpretation, 134, 137, 145, 159 
series (see Series, power) Quadripole, 354 
Principal axes, 137, 156, 159 : 
minor, 154, 406 R 
part of Laurent series, 294, 308 
value of an integral, 339 r-vector, 231 
value of logarithm, 313 curl, 234 
Projection, stereographic, 262, 372 divergence, 233 
Proper functions, 439, 440 gradient, 233 
values of a matrix, 140 Radius of convergence, 283 
values of wave equation, 441 Rank, determinant, 18, 39, 61, 78 
Pseudoscalar functions, 184 matrix, 39, 43, 61, 105, 114, 159 
Pulse functions, 474, 494, 499, 521, 536 _ quadratic form, 132, 137, 146, 148 
Fourier transform of, 474, 521, 536 vector set, 78, 101 
Ratio test, 278 
Q Rational fraction, 308, 359, 398, 410 
functions, 297, 307, 321 
Quadratic form, 132 Reactance, 416 
abridged form, 165 Reciprocal co-ordinates, 90 
associated latent roots, 139, 149, 157,162,171 function, 305, 315, 360 
associated linear transformation, 132, 151, matrix, 45, 48 
161 3 Rectangular co-ordinates (see Cartesian co- 
bilinear form, 134 ordinates) 
canonic form of, 150, 154, 156, 159, 172 Reflection, 361 
degeneracy, 137 Region of analyticity, 257, 288 
discriminant of, 132, 137, 146, 154 integration around, 268, 290 
effect of constraints upon, 165, 172 multiply connected, 221, 223, 271 
effect of co-ordinate transformation upon, simply connected, 221, 222, 271 
135, 165 Regular function, 257 


effect of linear transformation upon, 135 Residue, definition, 304 
geometrical interpretation, 134, 159, 165 evaluation, 293, 302, 305, 308 


(see also Quadric surface) Cauchy’s theorem, 302 

iterated, 155 in positive real functions, 411, 414 

law of inertia, 148 Riemann, 256, 281, 507 

matrix of, 132 surfaces, 289, 310, 318 

normal form (see Canonic form) visualization of, 315, 317, 321 

positive and negative definite, 144, 149, | Right-handed co-ordinate system, 84, 184 
150, 164, 171 Ring, vortex, 216 

rank, 133, 137, 146, 148 Root-mean-square value, 478 


reduction to sum of squares, more than | Roots (see Zeros) 
two forms, 164 Rotational field, 185 
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Rouché’s theorem, 327 Fourier, 448 
Routh, 395, 407 geometric, 436 
stability criteria, 395 infinite, 276 


Row matrix, 31 
Rule of alternatives, 105 


S 


Saddle point, 298, 321 
conformal mapping near, 300 
method of integration, 565 
Saw-tooth wave, 456, 464 
Scalar field, 183 
potential function, 183, 195, 199, 206, 221 
product of vectors, 80, 122, 184, 188, 228, 
266 
gradient of, 230 
triple product, 193, 198 
Scanning function, 488, 498 
Schlicht functions, 365 
Schwarz-Christoffel formula, 380 
transformation, 378 
critical points, 382 
examples, 387 
multivaluedness, 382 
Schwarz’s lemma, 327 
Screw rule, 183, 188, 209, 211, 214, 217 
Semiaxes, 140, 144 
Semicircular detour, in contour integration, 
382, 549, 552, 560 
Separation property, of zeros and poles, 400 
Sequence, arithmetic mean, 285 
of functions, 285 
of points, 276 
Series, Cesdro sum, 285 
convergence, absolute, 277 
Cauchy’s principle of, 277, 466 
circle, 283, 288 
comparison test, 279 
conditional, 281 
d’Alembert ratio test, 278 
Dedekind test, 282 
at a discontinuity, 464, 485, 500 
Fourier, 459, 464 
Laurent, 293 
Maclaurin, 287 
radius of, 283 
Taylor, 287, 288 
uniform, 283 
dominant, 279 
expansions of functions in, 286, 290, 305, 
307, 403, 415, 441 


Laurent, 290, 305, 319, 324, 410, 459 
Maclaurin, 287, 330, 352 
power, 280, 283 
differentiation and integration, 284, 303 
expansions of functions in, 286, 290, 305 
summation, 479 
Taylor, 287, 305, 322, 323 
in two variables, 511 
Signature, 148 
Simply connected region, 221, 222, 271 
Sine-integral, 491, 501, 521, 534 
Singular matrix, 39, 47 
point, 296 
Singularities, algebraic, 318 
branch points, 315, 319 
definition, 257 
effect on convergence of Taylor’s series, 
287, 288 
essential, 296 
exponential function, 296 
infinitely dense, 288, 291 
integration around, 290, 304, 382, 548, 552, 
560 
interpretation as vortexes, 265, 351 
isolated, 291 
logarithmic, 296, 345 
poles, 296, 323 
residue at, 304 
evaluation, 293, 302, 305, 308 
series expansions about, 290, 319 
types, 295 
Singularity functions, 531, 544, 553 
Sink, 185, 203, 388, 392 
Skew-symmetrical matrix, 39 
Solenoidal field, 185 
Sommerfeld, 507 
integral, 507, 562 
Source (see also Vortex) 
density, 215 
distribution, 200, 206, 291 
filamental, 204 
idealized, 204 
field, 185 
of flow lines, 185, 203, 388, 392 
point, 204 
voltage, 478 
Source-free field, 185, 207, 218, 265 
Spectrum, amplitude, 473 
continuous, 517 
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Spectrum, Fourier, 473 
frequency, 473, 514, 517 
line, 473, 514 
phase, 473 
Sphere, complex, 262, 317, 376 
Spherical harmonics, 440 
polar co-ordinates, 234, 239 
Square wave, 470 
Stability criteria, 395 
Stagnation, point of, 298 
Stationary point, 144, 172 
Step, unit, 533, 541, 545, 551 
Stereographic projection, 262, 317, 376 
Stieltjes continued fraction, 403 
Stokes’s law, 214, 223, 265, 270 
Stratum, double, 357 
Struik, D. J., 113 
Submatrices, 48 
Summation, Cesaro, 285, 496 
Fourier series, 479 
Surface, associated with quadratic form, 134 
constant-value, 195 
curl, 217 
divergence, 206 
integration, 201, 204, 209, 215 
mapping, 195 
quadric (see Quadric surface) 
Riemann, 289, 310, 318 
visualization of, 315, 317, 321 
of source distribution, 204 
Sylvester’s law of nullity, 109 
Symmetrical determinant, 15 
matrix, 39, 115 
latent roots of, 121 
transformation, 121, 133 


T 


Taylor’s series, 287, 305, 322, 323 
region of convergence, 287, 288 
relation to Laurent series, 290, 294 

Tensor, 92, 151, 187, 227 
components, 187 
fundamental metric, 92, 151 
matrix of, 187 : 
notation, 99 
order, 187 
valence, 187 

Thread, vortex, 216 

Time differentiation, 225, 439 
domain, 472, 519 
series (see Fourier series) 

Titchmarsh, E. C., 339, 340 
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Torque, 183, 208 
Transcendental functions, 297, 321 
Transform, Fourier, 473, 519 
inverse, 473, 519 
Hilbert, 339 
application to circuit theory, 346 
degenerate forms, 343 
Laplace, 548 
Transformation, bilinear, 363 
collineatory, 117, 137 
congruent, 136, 149, 157 
co-ordinate (see Co-ordinates, transforma- 
tion of) 
elementary, of matrices, 54, 146 
elliptic, 368 
fractional (see Linear fractional trans- 
formation) 
group, 373 
homographic, 364 
hyperbolic, 367 
linear (see Linear transformation) 
matrix (see Linear transformation) 
orthogonal, 62, 81, 137, 140, 156, 166, 172, 
184, 234 
Schwarz-Christoffel, 378 
symmetrical, 121, 133 
vector, 186, 227 
Transient functions, 517 
convolution of, 530 
effective value of, 530 
mean value of, 530 
Transmission line, 441, 452, 555 
Transpose, of a matrix, 43, 46 
of a vector set, 77, 163 
Triangular form of determinant, 7 
matrix, 67, 152, 154 
wave, 471 
Trigonometric functions, 439 
orthogonality of, 444 
polynomial, 436 
series (see Fourier series) 
Triple product of vectors, scalar, 193, 198 
vector, 192 
Triplet, unit, 543, 546 
Tschebyscheff polynomials, 431 
Tube, 185, 230. 
Turbulent field, 185, 186, 208, 215 


U 


Uniform convergence, 283 
abscissa of, 575 
Fejér polynomials, 500 
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Uniform convergence, Fourier series, 465 
tolerance, 502 
Uniqueness theorem, 290 
Unit circle, 284, 361, 369, 417, 419 
doublet, 542, 546 
impulse, 539, 545 
matrix, 38, 54 
step, 533, 541, 545, 551 
triplet, 543, 546 
vectors, 85, 167, 217 
in curvilinear co-ordinates, 235 
in oblique co-ordinates, 87 
scalar product of, 189 
vector product of, 192 


vV 


Valence of a tensor, 187 
Variables, complex (see Complex variables) 
functions of (see Functions of a complex 
variable) 
contragredient, 95 
Vector analysis, 183 
axial, 183 
constraint, 166 ; 
contravariant and covariant components, 
; 95, 159 
curl, 208, 213, 217, 229, 234, 243 
definition, 183 
del, 197 
derivative, 225, 232 
differential, 198, 201, 235 
differentiation, 225, 232 
direction cosines, 82, 101, 112, 162 
direction parameters, 126 
divergence, 200, 206, 228, 233, 242, 264 
field, irrotational, 185 : 
lamellar, 185 
nonturbulent, 185, 187, 213, 265 
potential, 185, 200, 218, 234, 263 
rotational, 185 
solenoidal, 185 
source-free, 185, 207, 218, 265 
turbulent, 185, 186, 208, 215 
force, 198, 208 
function, 218, 228, 263 
linear, 228 
potential, 218, 263, 266 
Gauss’s law, 203, 217, 264 
Gibbs notation, 188, 190 
Hamiltonian operator, 197, 203, 214 
interpretation of a complex number, 253 


Vector interpretatoin of a linear transforma- 


tion, 79, 186, 227 
of a quadric surface, 134, 137, 145, 159 
moment, 126, 354 , 
normalization of, 89 
orthogonal, 81, 84, 107, 121, 168, 189, 190, 
266 
polar, 183 
product of vectors, cross, 190 
determinantal form of, 192, 194 
dot, 188 
inner, 188 
scalar, 80, 122, 184, 188, 228, 266 
scalar triple, 193, 198 
triple vector, 192 
vector, 184, 190, 228 
r, 231 
set, associated with quadric surface, 134, 
137, 145, 159 
inconsistent, 104 
linearly dependent, 77, 100, 114, 143 
linearly independent, 77, 145 
orthogonal, 81, 121, 168 
rank, 78, 101 
transposed, 77, 163 
Stokes’s law, 214, 223, 265, 270 
time-varying, 225 
unit, 85, 167, 217 
in curvilinear co-ordinates, 235 
.in oblique co-ordinates, 87 
length, 88 
scalar product, 189 
vector product, 192 
useful relations, 228, 231 
use of vectors in interpreting linear equa- 
tions, 79, 100, 132 


Versors, 127 
Vertex angles of a polygon, 385 
Vortex, definition, 186 


density, 208, 215, 221 

distribution, 215, 219, 291 

field, 186 

representation by a singularity, 265, 
351 

ring, 216 

thread, 216 


w-plane, 258 
Wave equation, 440 


proper values of, 441 
motion, 439 
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Weber, 507 Zeros, 297, 298, 323, 395 
Winding point, 314 conformal mapping near, 300 
Work, evaluation of, by line integral, 198, 208 detection of, 325, 408 
Wronskian, 25 of a Hurwitz polynomial, 395 


multiplicity of, 298 

of a polynomial, 308, 324, 395 
g-plane, 258 of a positive real function, 411, 414 
_ associated complex sphere, 262 separation property of, 400 


Z. 


| a 
JOHN WILEY 
& SONS, INC, 


